Navigate back to the homepage
Get a Demo

Industry perspective: Engineering with a Lean Data Diet in mind

Andrew Moon
February 19th, 2021 · 1 min read

The Lean Data Diet didn’t have a catchy name until recently, but it’s a methodology that Nuria Ruiz and other engineers at the Wikimedia Foundation and Wikipedia have been practicing for a long time.

“This idea of the Lean Data Diet is…that you should be very purposeful with the data you gather and make sure that it truly enhances the product and the value that you’re trying to provide to your customers,” she explained at a recent privacy_infra() event.

Wikipedia is built on the idea of a Free Knowledge Movement, and they believe “there cannot be access to free knowledge without a strong guarantee of privacy.” According to Ruiz, a volunteer Engineer with the Wikimedia Foundation, they achieve privacy through three engineering measures: deleting data at scale, sanitizing data, and building a privacy culture.

For example, deleting data at scale is traditionally a dangerous undertaking that cannot be undone, but Wikipedia has developed a system with a built-in safety net to prevent unintended data loss.

They rely on a final checksum argument determined early in the deletion process to verify that the final data set is correct. “That way, if you want to schedule the deletion…and instead of saying older than 90, you say older than nine…nothing will happen because the arguments do not match the checksum.”

[In my] experience as an engineer, when you do simple solutions, very low tech, they are alive for a long time.”

She further expands on how the team of volunteer software engineers sanitizes data and has successfully built a privacy culture using the Lean Data Diet methodology in her full talk from Privacy_Infra(), which you can watch below.

“Privacy is itself a feature. It is not something that we do, but it is part of our product offering.”

Note: This post reflects information and opinions shared by speakers at Transcend’s ongoing privacy_infra() event series, which feature industry-wide tech talks highlighting new thinking in data privacy engineering every other month.

If you’re working on solving universal privacy challenges and interested in speaking about it, submit a proposal to speak at an upcoming event here

More articles from Transcend

Our 6 takeaways from this year’s Data Privacy Day

From understanding what makes the perfect privacy engineer, to getting smarter about privacy tools and tech. Here’s our key takeaways from The Rise of Privacy Tech’s Data Privacy Day event.

February 2nd, 2021 · 2 min read

Integrations spotlight: Data privacy and customer engagement, scheduling, and financial reporting

Our integrations engineering team had a busy end to 2020, with new privacy request integrations with Sendoso, Salesforce Pardot, ActiveCampaign, and more.

January 22nd, 2021 · 2 min read

Privacy XFN

Sign up for Transcend's weekly privacy newsletter.

San Francisco, California Copyright © 2021 Transcend, Inc.
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Link to $https://twitter.com/transcend_ioLink to $https://www.linkedin.com/company/transcend-io/Link to $https://github.com/transcend-io