page-mod
Overview
The page-mod module enables add-on developers to execute scripts in the context of specific web pages. Most obviously you could use page-mod to dynamically modify the content of certain pages.
The module exports a constructor function PageMod
which creates a new page
modification (or "mod" for short).
A page mod does not modify its pages until those pages are loaded or reloaded. In other words, if your add-on is loaded while the user's browser is open, the user will have to reload any open pages that match the mod for the mod to affect them.
To stop a page mod from making any more modifications, call its destroy
method.
Like all modules that interact with web content, page-mod uses content scripts that execute in the content process and defines a messaging API to communicate between the content scripts and the main add-on script. For more details on content scripting see the tutorial on interacting with web content.
To create a PageMod the add-on developer supplies:
-
a set of rules to select the desired subset of web pages based on their URL. Each rule is specified using the match-pattern syntax.
-
a set of content scripts to execute in the context of the desired pages.
-
a value for the onAttach option: this value is a function which will be called when a page is loaded that matches the ruleset. This is used to set up a communication channel between the add-on code and the content script.
All these parameters are optional except for the ruleset, which must include at least one rule.
The following add-on displays an alert whenever a page matching the ruleset is loaded:
var pageMod = require("page-mod");
pageMod.PageMod({
include: "*.org",
contentScript: 'window.alert("Page matches ruleset");'
});
If you specify a value of "ready" or "end" for contentScriptWhen
,
as opposed to "start",
then the content script can interact with the DOM itself:
var pageMod = require("page-mod");
pageMod.PageMod({
include: "*.org",
contentScriptWhen: 'end',
contentScript: 'document.body.innerHTML = ' +
' "<h1>Page matches ruleset</h1>";'
});
Using contentScriptFile
Most of the examples in this page define content scripts as strings,
and use the contentScript
option to assign them to page mods.
Alternatively, you can create content scripts in separate files
under your add-on's data
directory. Then you can use the
self
module to retrieve a URL pointing
to the file, and assign this to the page-mod's contentScriptFile
property.
For example, if you save the content script
file in your data
directory as "myScript.js", you would assign it using
code like:
var data = require("self").data;
var pageMod = require("page-mod");
pageMod.PageMod({
include: "*.org",
contentScriptWhen: 'end',
contentScriptFile: data.url("myScript.js")
});
Unless your content script is extremely simple and consists only of a
static string, don't use contentScript
: if you do, you may
have problems getting your add-on approved on AMO.
Instead, keep the script in a separate file and load it using
contentScriptFile
. This makes your code easier to maintain,
secure, debug and review.
Styling web pages
Sometimes adding a script to web pages is not enough, you also want to style
them. PageMod
provides an easy way to do that through options' contentStyle
and contentStyleFile
properties:
var data = require("self").data;
var pageMod = require("page-mod");
pageMod.PageMod({
include: "*.org",
contentStyleFile: data.url("my-page-mod.css"),
contentStyle: [
"div { padding: 10px; border: 1px solid silver}",
"img { display: none}"
]
})
PageMod
will add these styles as
user style sheets.
Working with Relative URLs in CSS Rules
You can't currently use relative URLs in style sheets loaded in this way. For example, consider a CSS file that references an external file like this:
background: rgb(249, 249, 249) url('../../img/my-background.png') repeat-x top center;
If you attach this file using contentStyleFile
, "my-background.png"
will not be found.
As a workaround for this, you can build an absolute URL to a file in your
"data" directory, and construct a contentStyle
option embedding that URL
in your rule. For example:
var data = require("self").data;
var pageMod = require("page-mod").PageMod({
include: "*",
contentStyleFile: data.url("my-style.css"),
// contentStyle is built dynamically here to include an absolute URL
// for the file referenced by this CSS rule.
// This workaround is needed because we can't use relative URLs
// in contentStyleFile or contentStyle.
contentStyle: "h1 { background: url(" + data.url("my-pic.jpg") + ")}"
});
This add-on uses a separate file "my-style.css", loaded using
contentStyleFile
, for all CSS rules except those that reference
an external file. For the rule that needs to refer to "my-pic.jpg",
which is stored in the add-on's "data" directory, it uses contentStyle
.
Dynamically constructing code strings like those assigned to contentScript
or contentStyle
is usually considered an unsafe practice that may cause an
add-on to fail AMO review. In this case it is safe, and should be allowed,
but including a comment like that in the example above will help to
prevent any misunderstanding.
Communicating With Content Scripts
When a matching page is loaded the PageMod
will call the function that the
add-on code supplied to onAttach
. The PageMod
supplies one argument to
this function: a worker
object.
The worker can be thought of as the add-on's end of a communication channel between the add-on code and the content scripts that have been attached to this page.
Thus the add-on can pass messages to the content scripts by calling the
worker's postMessage
function and can receive messages from the content
scripts by registering a function as a listener to the worker's on
function.
Note that if multiple matching pages are loaded simultaneously then each page
is loaded into its own execution context with its own copy of the content
scripts. In this case onAttach
is called once for each loaded page, and the
add-on code will have a separate worker for each page:
This is demonstrated in the following example:
var pageMod = require("page-mod");
var tabs = require("tabs");
var workers = [];
pageMod.PageMod({
include: ["http://www.mozilla*"],
contentScriptWhen: 'end',
contentScript: "onMessage = function onMessage(message) {" +
" window.alert(message);};",
onAttach: function onAttach(worker) {
if (workers.push(worker) == 3) {
workers[0].postMessage("The first worker!");
workers[1].postMessage("The second worker!");
workers[2].postMessage("The third worker!");
}
}
});
tabs.open("http://www.mozilla.com");
tabs.open("http://www.mozilla.org");
tabs.open("http://www.mozilla-europe.org");
Here we specify a ruleset to match any URLs starting with "http://www.mozilla". When a page matches we add the supplied worker to an array, and when we have three workers in the array we send a message to each worker in turn, telling it the order in which it was attached. The worker just displays the message in an alert box.
This shows that separate pages execute in separate contexts and that each context has its own communication channel with the add-on script.
Note though that while there is a separate worker for each execution context,
the worker is shared across all the content scripts associated with a single
execution context. In the following example we pass two content scripts into
the PageMod
: these content scripts will share a worker instance.
In the example each content script identifies itself to the add-on script
by sending it a message using the global postMessage
function. In the
onAttach
function the add-on code logs the fact that a new page is
attached and registers a listener function that simply logs the message:
var pageMod = require("page-mod");
var data = require("self").data;
var tabs = require("tabs");
pageMod.PageMod({
include: ["http://www.mozilla*"],
contentScriptWhen: 'end',
contentScript: ["postMessage('Content script 1 is attached to '+ " +
"document.URL);",
"postMessage('Content script 2 is attached to '+ " +
"document.URL);"],
onAttach: function onAttach(worker) {
console.log("Attaching content scripts")
worker.on('message', function(data) {
console.log(data);
});
}
});
tabs.open("http://www.mozilla.com");
The console output of this add-on is:
info: Attaching content scripts info: Content script 1 is attached to http://www.mozilla.com/en-US/ info: Content script 2 is attached to http://www.mozilla.com/en-US/
Mapping workers to tabs
The worker
has a tab
property which returns the tab associated with this worker. You can use this
to access the tabs API
for the tab
associated with a specific page:
var pageMod = require("page-mod");
var tabs = require("tabs");
pageMod.PageMod({
include: ["*"],
onAttach: function onAttach(worker) {
console.log(worker.tab.title);
}
});
Attaching content scripts to tabs
We've seen that the page mod API attaches content scripts to pages based on their URL. Sometimes, though, we don't care about the URL: we just want to execute a script on demand in the context of a particular tab.
For example, we might want to run a script in the context of the currently active tab when the user clicks a widget: to block certain content, to change the font style, or to display the page's DOM structure.
Using the attach
method of the tab
object, you can attach a set of content scripts to a particular tab. The
scripts are executed immediately.
The following add-on creates a widget which, when clicked, highlights all the
div
elements in the page loaded into the active tab:
var widgets = require("widget");
var tabs = require("tabs");
var widget = widgets.Widget({
id: "div-show",
label: "Show divs",
contentURL: "http://www.mozilla.org/favicon.ico",
onClick: function() {
tabs.activeTab.attach({
contentScript:
'var divs = document.getElementsByTagName("div");' +
'for (var i = 0; i < divs.length; ++i) {' +
'divs[i].setAttribute("style", "border: solid red 1px;");' +
'}'
});
}
});
Destroying Workers
Workers generate a detach
event when their associated page is closed: that
is, when the tab is closed or the tab's location changes. If
you are maintaining a list of workers belonging to a page mod, you can use
this event to remove workers that are no longer valid.
For example, if you maintain a list of workers attached to a page mod:
var workers = [];
var pageMod = require("page-mod").PageMod({
include: ['*'],
contentScriptWhen: 'ready',
contentScriptFile: data.url('pagemod.js'),
onAttach: function(worker) {
workers.push(worker);
}
});
You can remove workers when they are no longer valid by listening to detach
:
var workers = [];
function detachWorker(worker, workerArray) {
var index = workerArray.indexOf(worker);
if(index != -1) {
workerArray.splice(index, 1);
}
}
var pageMod = require("page-mod").PageMod({
include: ['*'],
contentScriptWhen: 'ready',
contentScriptFile: data.url('pagemod.js'),
onAttach: function(worker) {
workers.push(worker);
worker.on('detach', function () {
detachWorker(this, workers);
});
}
});
API Reference
Classes
PageMod
A PageMod object. Once activated a page mod will execute the supplied content scripts in the context of any pages matching the pattern specified by the 'include' property.
Constructors
PageMod(options)
Creates a PageMod.
Options for the PageMod, with the following keys:
A match pattern string or an array of match pattern strings. These define the pages to which the PageMod applies. See the match-pattern module for a description of match pattern syntax. At least one match pattern must be supplied.
The local file URLs of content scripts to load. Content scripts specified
by this option are loaded before those specified by the contentScript
option. Optional.
The texts of content scripts to load. Content scripts specified by this
option are loaded after those specified by the contentScriptFile
option.
Optional.
When to load the content scripts. This may take one of the following values:
- "start": load content scripts immediately after the document element for the page is inserted into the DOM, but before the DOM content itself has been loaded
- "ready": load content scripts once DOM content has been loaded, corresponding to the DOMContentLoaded event
- "end": load content scripts once all the content (DOM, JS, CSS, images) for the page has been loaded, at the time the window.onload event fires
This property is optional and defaults to "end".
Read-only value exposed to content scripts under self.options
property.
Any kind of jsonable value (object, array, string, etc.) can be used here. Optional.
The local file URLs of stylesheet to load. Content style specified by this
option are loaded before those specified by the contentStyle
option.
Optional.
The texts of stylesheet rules to add. Content styles specified by this
option are loaded after those specified by the contentStyleFile
option.
Optional.
A function to call when the PageMod attaches content scripts to
a matching page. The function will be called with one argument, a worker
object which the add-on script can use to communicate with the content scripts
attached to the page in question.
Methods
destroy()
Stops the page mod from making any more modifications. Once destroyed the page
mod can no longer be used. Note that modifications already made to open pages
will not be undone, except for any stylesheet added by contentStyle
or
contentStyleFile
, that are unregistered immediately.
Properties
include : List
A list of match pattern strings. These
define the pages to which the page mod applies. See the
match-pattern module for a
description of match patterns. Rules can be added to the list by calling its
add
method and removed by calling its remove
method.
Events
attach
This event is emitted this event when the page-mod's content scripts are
attached to a page whose URL matches the page-mod's include
filter.
The listener function is passed a Worker
object that can be used to communicate
with any content scripts attached to this page.
error
This event is emitted when an uncaught runtime error occurs in one of the page mod's content scripts.
Listeners are passed a single argument, the Error object.