Writing Instrumentation Tests

Old Glean Proverb

If it’s important enough to be instrumented, it’s important enough to be tested.

All metrics and pings in the Glean SDK have well-documented APIs for testing. You’ll want to familiarize yourself with TestGetValue() (here’s an example JS (xpcshell) test of some metrics) for metrics and TestBeforeNextSubmit() (here’s an example C++ (gtest) test of a custom ping) for pings.

All test APIs are available in all three of FOG’s supported languages: Rust, C++, and JavaScript.

But how do you get into a position where you can even call these test APIs? How do they fit in with Firefox Desktop’s testing frameworks?

Manual Testing and Debugging

The Glean SDK has debugging capabilities for manually verifying that instrumentation makes it to Mozilla’s Data Pipeline. Firefox Desktop supports these via environment variables and via the interface on about:glean.

This is all well and good for getting a good sense check that things are going well now, but in order to check that everything stays good through the future, you’re going to want to write some automated tests.

General Things To Bear In Mind

  • You may see values from previous tests persist across tests because the profile directory was shared between test cases.

    • You can reset Glean before your test by calling Services.fog.testResetFOG() (in JS).

      • If your instrumentation isn’t on the parent process, you should call await Services.fog.testFlushAllChildren() before testResetFOG. That will ensure all pending data makes it to the parent process to be cleared.

    • You shouldn’t have to do this in C++ or Rust since there you should use the FOGFixture test fixture.

  • If your metric is based on timing (timespan, timing_distribution), do not expect to be able to assert the correct timing value. Glean does a lot of timing for you deep in the SDK, so unless you mock the system’s monotonic clock, do not expect the values to be predictable.

    • Instead, check that a value is > 0 or that the number of samples is expected.

    • You might be able to assert that the value is at least as much as a known, timed value, but beware of rounding.

    • If your metric is a timing_distribution mirroring to a Telemetry probe via GIFFT, there may be small observed differences between systems that can cause equality assertions to fail.

  • Errors in instrumentation APIs do not panic, throw, or crash. But Glean remembers that the errors happened.

    • Test APIs, on the other hand, are permitted (some may say “encouraged”) to panic, throw, or crash on bad behaviour.

    • If you call a test API and it panics, throws, or crashes, that means your instrumentation did something wrong. Check your test logs for details about what went awry.

Tests and Artifact Builds

Artifact build support is provided by the JOG subsystem. It is able to register the latest versions of all metrics and pings at runtime. However, the compiled code is still running against the version of those metrics and pings that was current at the time the artifacts were compiled.

This isn’t a problem unless:

  • You are changing a metric or ping that is used in instrumentation in the compiled code, or

  • You are using testBeforeNextSubmit in JavaScript for a ping submitted in the compiled code.

When in doubt, simply test your new test in artifact mode (by e.g. passing --enable-artifact-builds to mach try) before submitting it. If it doesn’t pass in artifact mode because of one of these two cases, you may need to skip your test whenever FOG’s artifact build support is enabled:

  • xpcshell:

add_task(
  { skip_if: () => Services.prefs.getBoolPref("telemetry.fog.artifact_build", false) },
  function () {
    // ... your test ...
  }
);
  • mochitest:

add_task(function () {
  if (Services.prefs.getBoolPref("telemetry.fog.artifact_build", false)) {
    Assert.ok(true, "Test skipped in artifact mode.");
    return;
  }
  // ... your test ...
});

The Usual Test Format

Instrumentation tests tend to follow the same three-part format:

  1. Assert no value in the metric

  2. Express behaviour

  3. Assert correct value in the metric

Your choice of test suite will depend on how the instrumented behaviour can be expressed.

xpcshell Tests

If the instrumented behaviour is on the main or content process and can be called from privileged JS, xpcshell is an excellent choice.

xpcshell is so minimal an environment, however, that (pending bug 1756055) you’ll need to manually tell it you need two things:

  1. A profile directory

  2. An initialized FOG

/* Any copyright is dedicated to the Public Domain.
   http://creativecommons.org/publicdomain/zero/1.0/ */

"use strict";

add_setup(function test_setup() {
  // FOG needs a profile directory to put its data in.
  do_get_profile();

  // FOG needs to be initialized in order for data to flow.
  Services.fog.initializeFOG();
});

From there, just follow The Usual Test Format:

add_task(function test_instrumentation() {
  // 1) Assert no value
  Assert.equal(undefined, Glean.myMetricCategory.myMetricName.testGetValue());

  // 2) Express behaviour
  // ...<left as an exercise to the reader>...

  // 3) Assert correct value
  Assert.equal(kValue, Glean.myMetricCategory.myMetricName.testGetValue());
});

If your new instrumentation includes a new custom ping, there are two small additions to The Usual Test Format:

  • 1.1) Call testBeforeNextSubmit before your ping is submitted. The callback you register in testBeforeNextSubmit is called synchronously with the call to the ping’s submit().

  • 3.1) Check that the ping actually was submitted. If all your Asserts are inside testBeforeNextSubmit’s closure, another way this test could pass is by not running any of them.

add_task(function test_custom_ping() {
  // 1) Assert no value
  Assert.equal(undefined, Glean.myMetricCategory.myMetricName.testGetValue());

  // 1.1) Set up Step 3.
  let submitted = false;
  GleanPings.myPing.testBeforeNextSubmit(reason => {
    submitted = true;
    // 3) Assert correct value
    Assert.equal(kExpectedReason, reason, "Reason of submitted ping must match.");
    Assert.equal(kExpectedMetricValue, Glean.myMetricCategory.myMetricName.testGetValue());
  });

  // 2) Express behaviour that sends a ping with expected reason and contents
  // ...<left as an exercise to the reader>...

  // 3.1) Check that the ping actually was submitted.
  Assert.ok(submitted, "Ping was submitted, callback was called.");
});

(( We acknowledge that this isn’t the most ergonomic form. Please follow bug 1756637 for updates on a better design and implementation for ping tests. ))

mochitest

browser-chrome-flavoured mochitests can be tested very similarly to xpcshell, though you do not need to request a profile or initialize FOG. plain-flavoured mochitests can use GleanTest.js. It doesn’t support all the features, only the testResetFOG, testFlushAllChildren (as flush), and testGetValue functions.

Asynchronicity

In GleanTest.js, test functions are all async meaning you need to await their results as they make the trip over IPC and back again.

<!doctype html>
<html>
  <head>
    <meta charset="utf-8" />
    <title>Sample SimpleTest with GleanTest.js</title>
    <script src="/tests/SimpleTest/SimpleTest.js"></script>
    <link rel="stylesheet" href="/tests/SimpleTest/test.css" />
    <script src="/tests/SimpleTest/GleanTest.js"></script>
  </head>
  <body>
    <p id="display"></p>
    <div id="content" style="display: none"></div>
    <pre id="test"></pre>

    <script>
      add_task(async function () {
        await GleanTest.testResetFOG();

        // Run code that sets your metric.

        const value = await GleanTest.myMetricCategory.myMetricName.testGetValue();
      });
    </script>
  </body>
</html>

If you’re testing in mochitest, your instrumentation (or your test) might not be running in the parent process. This means you get to learn the IPC test APIs.

IPC

All test APIs must be called on the main process (they’ll assert otherwise). But your instrumentation might be on any process, so how do you test it?

In this case there’s a slight addition to the Usual Test Format:

  1. Assert no value in the metric

  2. Express behaviour

  3. Flush all pending FOG IPC operations with await Services.fog.testFlushAllChildren()

  4. Assert correct value in the metric.

NOTE: We learned in bug 1843178 that the list of all content processes that Services.fog.testFlushAllChildren() uses is very quickly updated after the end of a call to BrowserUtils.withNewTab(...). If you are using withNewTab, you should consider calling testFlushAllChildren() within the callback.

GTests/Google Tests

Please make use of the FOGFixture fixture when writing your tests, like:

TEST_F(FOGFixture, MyTestCase) {
  // 1) Assert no value
  ASSERT_EQ(mozilla::Nothing(),
            my_metric_category::my_metric_name.TestGetValue());

  // 2) Express behaviour
  // ...<left as an exercise to the reader>...

  // 3) Assert correct value
  ASSERT_EQ(kValue,
            my_metric_category::my_metric_name.TestGetValue().unwrap().ref());
}

The fixture will take care of ensuring storage is reset between tests.

Rust rusttests

The general-purpose Testing & Debugging Rust Code in Firefox is a good thing to review first.

Unfortunately, FOG requires gecko (to tell it where the profile dir is, and other things), which means we need to use the GTest + FFI approach where GTest is the runner and Rust is just the language the test is written in.

This means your test will look like a GTest like this:

extern "C" void Rust_MyRustTest();
TEST_F(FOGFixture, MyRustTest) { Rust_MyRustTest(); }

Plus a Rust test like this:

#[no_mangle]
pub extern "C" fn Rust_MyRustTest() {
    // 1) Assert no value
    assert_eq!(None,
               fog::metrics::my_metric_category::my_metric_name.test_get_value(None));

    // 2) Express behaviour
    // ...<left as an exercise to the reader>...

    // 3) Assert correct value
    assert_eq!(Some(value),
               fog::metrics::my_metric_category::my_metric_name.test_get_value(None));
}