Home / Blog / Transitioning to a New Backend Pipeline and Data Availability

Transitioning to a New Backend Pipeline and Data Availability

Posted by Chris Ritzo on 2017-05-02
bigquery, data, data analysis, gcs, performance, pipeline, research, platform

M-Lab data is collected from distributed experiments hosted on servers all over the world, processed in a pipeline, and published for free in both raw and parsed (structured) formats. The back end processing component for this has served us well for many years, but it’s been showing its age recently. As M-Lab collects an increasing amount of data thanks to new partnerships, we have been concerned that it will not be as reliable.

To address this, we’ve been working on replacing the aging back-end system with a new cloud based pipeline that will be able to process much higher data volumes with greater transparency and lower latency. As part of the switch from the old pipeline to the new one, we will have a period of time where data will not be published in a regular and timely interval. All data will be published, but we anticipate a period during the transition where data from April 21 to the end of May will not be published until June 1st. Data published before that time will remain available, and experiments will still function as expected despite the delay in data publication. Experiments that collect data through their own independent pipelines will not be affected. We expect that when this transition is complete, experiment data will be published in both raw and parsed formats with no more than 1 day of latency.

We will continue to update you on additional developments on the transition and look forward to posting more exciting information about our new cloud based pipeline.

Archive

June 2026 (1)
March 2026 (1)
February 2026 (1)
December 2025 (2)
November 2025 (2)
July 2025 (2)
June 2025 (2)
May 2025 (1)
April 2025 (2)
October 2024 (1)
August 2024 (1)
June 2024 (2)
March 2024 (2)
January 2024 (1)
December 2023 (1)
September 2023 (1)
August 2023 (1)
July 2023 (2)
June 2023 (2)
May 2023 (1)
April 2023 (2)
January 2023 (1)
November 2022 (1)
September 2022 (2)
August 2022 (2)
July 2022 (3)
June 2022 (2)
May 2022 (1)
March 2022 (2)
February 2022 (6)
January 2022 (3)
December 2021 (1)
November 2021 (1)
October 2021 (1)
August 2021 (1)
July 2021 (2)
June 2021 (1)
March 2021 (3)
February 2021 (2)
January 2021 (1)
November 2020 (1)
September 2020 (2)
August 2020 (2)
July 2020 (4)
May 2020 (4)
April 2020 (4)
March 2020 (1)
January 2020 (2)
December 2019 (1)
November 2019 (1)
October 2019 (3)
September 2019 (1)
August 2019 (1)
July 2019 (3)
June 2019 (1)
May 2019 (1)
April 2019 (2)
March 2019 (5)
February 2019 (2)
January 2019 (1)
November 2018 (1)
October 2018 (2)
September 2018 (1)
July 2018 (3)
April 2018 (1)
February 2018 (1)
January 2018 (2)
August 2017 (1)
May 2017 (1)
April 2017 (1)
March 2017 (1)
November 2016 (1)
October 2016 (1)
May 2016 (1)
March 2016 (1)
January 2016 (1)
June 2015 (1)
April 2015 (1)
February 2015 (1)
November 2014 (2)
October 2014 (2)
March 2014 (2)
November 2013 (1)
September 2013 (1)
August 2013 (1)

GitHub
Twitter
Google
Email
RSS
LinkedIn

Measurement Lab is a fiscally sponsored project of Superbloom

About
Contact
Jobs

Support
Blog
Discuss Group

Code
RSS

Privacy Policy
Acceptable Use Policy

All original material on Measurement Lab is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 4.0 International License. M-Lab is a collaborative effort led by researchers in partnership with companies and other institutions.

We would like to use third party cookies and scripts to improve the functionality of this website.Approve More info

Measurement Lab is led by teams based at Superbloom; Google, Inc; and supported by partners around the world.

Learn more about M-Lab. Get Involved.

Transitioning to a New Backend Pipeline and Data Availability

Categories

Archive