page-worker
The page-worker
module provides a way to create a permanent, invisible page
and access its DOM.
The module exports a constructor function Page
, which constructs a new page
worker. A page worker may be destroyed, after which its memory is freed, and
you must create a new instance to load another page.
You specify the page to load using the contentURL
option to the
Page()
constructor.
This can point to a remote file:
pageWorker = require("page-worker").Page({
contentScript: "console.log(document.body.innerHTML);",
contentURL: "http://en.wikipedia.org/wiki/Internet"
});
It can also point to an HTML file which you've packaged with your add-on.
To do this, save the file in your add-on's data
directory and create the
URL using the data.url()
method of the
self
module:
pageWorker = require("page-worker").Page({
contentScript: "console.log(document.body.innerHTML);",
contentURL: require("self").data.url("myFile.html")
});
You can load a new page by setting the page worker's contentURL
property.
In this example we fetch the first paragraph of a page from Wikipedia, then
the first paragraph of a different page:
var getFirstParagraph = "var paras = document.getElementsByTagName('p');" +
"console.log(paras[0].textContent);" +
"self.port.emit('loaded');"
pageWorker = require("page-worker").Page({
contentScript: getFirstParagraph,
contentURL: "http://en.wikipedia.org/wiki/Chalk"
});
pageWorker.port.on("loaded", function() {
pageWorker.contentURL = "http://en.wikipedia.org/wiki/Cheese"
});
Scripting Page-Worker Content
To access the page's DOM you need to attach a script to it. In the SDK these scripts are called "content scripts" because they're explicitly used for interacting with web content.
You can specify one or more content scripts to load into the page using the
contentScript
or contentScriptFile
options to the
Page()
constructor.
With contentScript
you pass the script as a string, as in the examples
above. With contentScriptFile
you pass a URL which points to a script
saved under your add-on's data
directory. You construct the URL using
the data.url()
method of the
self
module.
While content scripts can access DOM content, they can't access any of the SDK APIs, so in many cases you'll need to exchange messages between the content script and your main add-on code for a complete solution.
For example, the content script might read some content and send it back to
the main add-on, which could store it using the
simple-storage
API. You can
communicate with the script using either the
postMessage()
API or (preferably, usually) the
port
API.
For example, this add-on loads a page from Wikipedia, and runs a content script in it to send all the headers back to the main add-on code:
var pageWorkers = require("page-worker");
// This content script sends header titles from the page to the add-on:
var script = "var elements = document.querySelectorAll('h2 > span'); " +
"for (var i = 0; i < elements.length; i++) { " +
" postMessage(elements[i].textContent) " +
"}";
// Create a page worker that loads Wikipedia:
pageWorkers.Page({
contentURL: "http://en.wikipedia.org/wiki/Internet",
contentScript: script,
contentScriptWhen: "ready",
onMessage: function(message) {
console.log(message);
}
});
For conciseness, this example creates the content script as a string and uses
the contentScript
property. In your own add-ons, you will probably want to
create your content scripts in separate files and pass their URLs using the
contentScriptFile
property.
Unless your content script is extremely simple and consists only of a
static string, don't use contentScript
: if you do, you may
have problems getting your add-on approved on AMO.
Instead, keep the script in a separate file and load it using
contentScriptFile
. This makes your code easier to maintain,
secure, debug and review.
To learn much more about content scripts, see the Working with Content Scripts guide.
Scripting Trusted Page Content
Note that the feature described in this section is experimental: we'll
very probably continue to support it, but the name of the addon
property might change in a future release.
We've already seen that you can package HTML files in your add-on's data
directory and load them using page-worker
. We can call this "trusted"
content, because unlike content loaded from a source outside the
add-on, the add-on author knows exactly what it's doing. To
interact with trusted content you don't need to use content scripts:
you can just include a script from the HTML file in the normal way, using
<script>
tags.
Like a content script, these scripts can communicate with the add-on code
using the
postMessage()
API or the
port
API.
The crucial difference is that these scripts access the postMessage
and port
objects through the addon
object, whereas content scripts
access them through the self
object.
So given an add-on that loads trusted content and uses content scripts to access it, there are typically three changes you have to make, if you want to use normal page scripts instead:
-
in the content script: change occurrences of
self
toaddon
. For example,self.port.emit("my-event")
becomesaddon.port.emit("my-event")
. -
in the HTML page itself: add a
<script>
tag to load the script. So if your content script is saved underdata
as "my-script.js", you need a line like<script src="my-script.js"></script>
in the page header. -
in the "main.js" file: remove the
contentScriptFile
option in thePage()
constructor.
API Reference
Classes
Page
A Page
object loads the page specified by its contentURL
option and
executes any content scripts that have been supplied to it in the
contentScript
and contentScriptFile
options.
The page is not displayed to the user.
The page worker is loaded as soon as the Page
object is created and stays
loaded until its destroy
method is called or the add-on is unloaded.
Constructors
Page(options)
Creates an uninitialized page worker instance.
The options
parameter is optional, and if given it should be an object
with any of the following keys:
The URL of the content to load in the panel.
An object with keys to configure the permissions on the page worker. The
boolean key script
controls if scripts from the page are allowed to run.
script
defaults to true.
A local file URL or an array of local file URLs of content scripts to load.
Content scripts specified by this option are loaded before those specified
by the contentScript
option. See
Working with Content Scripts
for help on setting this property.
A string or an array of strings containing the texts of content scripts to
load. Content scripts specified by this option are loaded after those
specified by the contentScriptFile
option.
When to load the content scripts. This may take one of the following values:
- "start": load content scripts immediately after the document element for the page is inserted into the DOM, but before the DOM content itself has been loaded
- "ready": load content scripts once DOM content has been loaded, corresponding to the DOMContentLoaded event
- "end": load content scripts once all the content (DOM, JS, CSS, images) for the page has been loaded, at the time the window.onload event fires
This property is optional and defaults to "end".
Read-only value exposed to content scripts under self.options
property.
Any kind of jsonable value (object, array, string, etc.) can be used here. Optional.
Use this to add a listener to the page worker's message
event.
Methods
destroy()
Unloads the page worker. After you destroy a page worker, its memory is freed and you must create a new instance if you need to load another page.
postMessage(message)
Sends a message to the content scripts.
The message to send. Must be JSON-able.
on(type, listener)
Registers an event listener with the page worker. See Working with Events for help with events.
The type of event to listen for.
The listener function that handles the event.
removeListener(type, listener)
Unregisters an event listener from the page worker.
The type of event for which listener
was registered.
The listener function that was registered.
Properties
port : EventEmitter
EventEmitter object that allows you to:
- send events to the content script using the
port.emit
function - receive events from the content script using the
port.on
function
See the guide to
communicating using port
for details.
contentURL : string
The URL of content to load. This can point to local content loaded from your add-on's "data" directory or remote content. Setting it loads the content immediately.
allow : object
A object describing permissions for the content. It contains a single key
named script
whose value is a boolean that indicates whether or not to
execute script in the content. script
defaults to true.
contentScriptFile : string,array
A local file URL or an array of local file URLs of content scripts to load.
contentScript : string,array
A string or an array of strings containing the texts of content scripts to load.
contentScriptWhen : string
When to load the content scripts. This may have one of the following values:
- "start": load content scripts immediately after the document element for the page is inserted into the DOM, but before the DOM content itself has been loaded
- "ready": load content scripts once DOM content has been loaded, corresponding to the DOMContentLoaded event
- "end": load content scripts once all the content (DOM, JS, CSS, images) for the page has been loaded, at the time the window.onload event fires
contentScriptOptions : object
Read-only value exposed to content scripts under self.options
property.
Any kind of jsonable value (object, array, string, etc.) can be used here. Optional.
Events
message
If you listen to this event you can receive message events from content
scripts associated with this page worker. When a content script posts a
message using self.postMessage()
, the message is delivered to the add-on
code in the page worker's message
event.
Listeners are passed a single argument which is the message posted from the content script. The message can be any JSON-serializable value
error
This event is emitted when an uncaught runtime error occurs in one of the page worker's content scripts.
Listeners are passed a single argument, the Error object.