Transitioning to a New Backend Pipeline and Data Availability

Posted by Chris Ritzo on 2017-05-02
bigquery, data, data analysis, gcs, performance, pipeline, research, platform

M-Lab data is collected from distributed experiments hosted on servers all over the world, processed in a pipeline, and published for free in both raw and parsed (structured) formats. The back end processing component for this has served us well for many years, but it’s been showing its age recently. As M-Lab collects an increasing amount of data thanks to new partnerships, we have been concerned that it will not be as reliable.

Making it Easier to Use M-Lab Data

Posted by Michael Lynch on 2016-03-17
bigquery, gcs, performance, data

In January, M-Lab launched a beta test of new BigQuery tables for M-Lab data. Today, M-Lab is pleased to announce that the beta test was successful. The new, faster-performing tables will be M-Lab’s new standard BigQuery tables.

Before we move on to specifics, when we say faster performing, we mean a lot faster. As in, certain queries that used to take over 2 hours now complete in 8 seconds. That means that playing with the data just became a lot more fun.

To help users dig in to this data as quickly and seamlessly as possible, M-Lab has consolidated all of its data documentation and updated it to show how to take advantage of the new tables.

Visualization (8)
Data analysis (7)
Ripe (4)
Gsoc (1)
Open source (5)
Research (50)
Interconnection (6)
Consumer internet (2)
Performance (15)
Tos (1)
Observatory (5)
Features (1)
Data (61)
Transparency (2)
Bigquery (25)
Gcs (2)
Microbursts (3)
Switch discard (3)
Platform (19)
Dataviz (1)
Tcp (1)
Bbr (5)
Traffic congestion (1)
Paris traceroute (3)
Pipeline (5)
Upgrades (2)
Virtualization (1)
Kernel (5)
Tcp-info (8)
Web100 (2)
Bug (3)
Versioning (1)
Event (20)
Community (47)
Neubot (2)
Igf (1)
Paris-traceroute (3)
Open-source (4)
Speed (3)
Accuracy (2)
Broadband (1)
Mapping (1)
Data-analysis (4)
Schema (3)
Policies (1)
Gdpr (1)
Privacy (3)
Ndt-server (4)
Ndt (20)
Traceroute (9)
Digital inclusion (2)
Survey (1)
Covid19 (5)
Developer (1)
Ndt7 (7)
Gis (1)
Roadmap (1)
Wehe (1)
Policy (1)
Advocacy (1)
Murakami (1)
Ookla (2)
Latency (1)
Bufferbloat (1)
Responsiveness (1)
Governance (1)
Experiment review committee (1)
Fellowship (4)
Announcement (16)
Statistics (1)
Tutorial (2)
Annotations (1)
Cloud (2)
Guest (1)
Hackathon (1)
Leadership (3)
Partnership (4)
Aim (1)
Cloudflare (1)
Ndt5 (1)
Learning (2)
Analysis (2)
Wehe (1)
Bigquery (1)
Annual-review (1)
Internet quality (2)
Infrastructure (1)
Contribution (1)
Censorship measurement (5)
Publication (1)
Ndt7 (1)
Education (1)
Design (1)
Qoe (1)

Measurement Lab is led by teams based at Superbloom; Google, Inc; and supported by partners around the world.

Learn more about M-Lab. Get Involved.

Transitioning to a New Backend Pipeline and Data Availability

Making it Easier to Use M-Lab Data

Categories

Archive