Can Transparency Into AI's Data Sets Curb Escalating Privacy Risk?

As data privacy regulations and public pressure intensifies, identifying the data sets machine learning programs use may become a necessity. But it's not always easily achieved.

February 14, 2020 at 12:00 PM

3 minute read

By Victoria Hudgins
Reporter

Tracking what data in used in a machine learning data set can be a daunting task, but as regulations and public scrutiny intensifies, lawyers say it could be a useful tool to stay compliant.

Last week Facebook Inc. joined other developers in the quest to better trace the data used in data sets. In a research paper titled "Radioactive Data: Tracing Through Training" Facebook announced a new method to trace images used in data set for training software.

To be sure, Facebook isn't the first to announce a method that provides transparency into data sets. In its report, Facebook highlighted numerous data tracking mechanisms including watermarking, differential privacy and membership inference.

Lawyers contacted by Legaltech News said confirming the usage of specific information in a data set throughout the development of a software could be necessary as regulatory and public pressures grow over data privacy.

Such information can be be leveraged as evidence that an entity isn't compliant with corporate or regulatory privacy policies.

Just this month Facebook agreed to a $550 million settlement over Biometric Information Privacy Act (BIPA) violations and Google was served a similar class action lawsuit over alleged violations of the Illinois law.

However, Georgetown University Law Center professor Anupam Chander noted that figuring out what data is used in a machine learning data set is likely used for "constrained circumstances" to ensure a company's data isn't being used without its permission, not to provide transparency to data subjects.

Chander cited the recent news of facial recognition app Clearview scraping billions of images from Facebook, YouTube and Venmo for its law enforcement clientele as usage companies would want to prevent.

"You see the Clearview data set and Facebook has objected to its use of its data and so this is another way to demonstrate that Clearview or some third-party vendor used Facebook's images that has been manipulated to make these types of results."

While data tracking methods may help companies follow their data's usage, Chander said Facebook's method may not be helpful for spotting biased data.

"You need to be able to change the underlining data [in order to identify it] without changing the outcome, that's the promise of [Facebook's] paper. … It may not be so easy to change the underlining data without affecting the outcome in substantive ways when it comes to decisions about credit or employment," he said.

Still, as companies tangle with understanding how their data is being used and potential public backlash, Riesen noted that providing transparency into data sets may potentially leak software insights to competitors.

"This could expose information to a competitor about your proprietary machine learning and AI algorithms that you intended to be a competitive advantage or trade secret. This could lead to competitors studying how your algorithms are processing certain data," he said.

However, only the data sets used for the algorithm could be exposed, and not the processes. What's more, when weighing regulatory compliance and public relations, some companies may prefer to make their data sets more traceable when faced with varying privacy regulations, Riesen said.

This content has been archived. It is available through our partners, LexisNexis® and Bloomberg Law.

To view this content, please continue to their sites.

Go To Lexis →

Not a Lexis Subscriber?
Subscribe Now

Go To Bloomberg Law →

Not a Bloomberg Law Subscriber?
Subscribe Now

NOT FOR REPRINT

You Might Like

Latest

Trending

Who Got The Work

J. Brugh Lower of Gibbons has entered an appearance for industrial equipment supplier Devco Corporation in a pending trademark infringement lawsuit. The suit, accusing the defendant of selling knock-off Graco products, was filed Dec. 18 in New Jersey District Court by Rivkin Radler on behalf of Graco Inc. and Graco Minnesota. The case, assigned to U.S. District Judge Zahid N. Quraishi, is 3:24-cv-11294, Graco Inc. et al v. Devco Corporation.

Who Got The Work

Rebecca Maller-Stein and Kent A. Yalowitz of Arnold & Porter Kaye Scholer have entered their appearances for Hanaco Venture Capital and its executives, Lior Prosor and David Frankel, in a pending securities lawsuit. The action, filed on Dec. 24 in New York Southern District Court by Zell, Aron & Co. on behalf of Goldeneye Advisors, accuses the defendants of negligently and fraudulently managing the plaintiff's $1 million investment. The case, assigned to U.S. District Judge Vernon S. Broderick, is 1:24-cv-09918, Goldeneye Advisors, LLC v. Hanaco Venture Capital, Ltd. et al.

Who Got The Work

Attorneys from A&O Shearman has stepped in as defense counsel for Toronto-Dominion Bank and other defendants in a pending securities class action. The suit, filed Dec. 11 in New York Southern District Court by Bleichmar Fonti & Auld, accuses the defendants of concealing the bank's 'pervasive' deficiencies in regards to its compliance with the Bank Secrecy Act and the quality of its anti-money laundering controls. The case, assigned to U.S. District Judge Arun Subramanian, is 1:24-cv-09445, Gonzalez v. The Toronto-Dominion Bank et al.

Who Got The Work

Crown Castle International, a Pennsylvania company providing shared communications infrastructure, has turned to Luke D. Wolf of Gordon Rees Scully Mansukhani to fend off a pending breach-of-contract lawsuit. The court action, filed Nov. 25 in Michigan Eastern District Court by Hooper Hathaway PC on behalf of The Town Residences LLC, accuses Crown Castle of failing to transfer approximately $30,000 in utility payments from T-Mobile in breach of a roof-top lease and assignment agreement. The case, assigned to U.S. District Judge Susan K. Declercq, is 2:24-cv-13131, The Town Residences LLC v. T-Mobile US, Inc. et al.

Who Got The Work

Wilfred P. Coronato and Daniel M. Schwartz of McCarter & English have stepped in as defense counsel to Electrolux Home Products Inc. in a pending product liability lawsuit. The court action, filed Nov. 26 in New York Eastern District Court by Poulos Lopiccolo PC and Nagel Rice LLP on behalf of David Stern, alleges that the defendant's refrigerators’ drawers and shelving repeatedly break and fall apart within months after purchase. The case, assigned to U.S. District Judge Joan M. Azrack, is 2:24-cv-08204, Stern v. Electrolux Home Products, Inc.

Learn More About Radar

Featured Firms

Law Offices of Gary Martin Hays & Associates, P.C.

(470) 294-1674

Law Offices of Mark E. Salomone

(857) 444-6468

Smith & Hassler

(713) 739-1250

Can Transparency Into AI's Data Sets Curb Escalating Privacy Risk?

This content has been archived. It is available through our partners, LexisNexis® and Bloomberg Law.

You Might Like

Who Got The Work

Who Got The Work

Who Got The Work

Who Got The Work

Who Got The Work

Featured Firms

More from ALM

Subscribe to Legal Tech News