Is it time for automated classification of records?
The holy grail of records management is automated classification taking people out of the record retention process and having computers automatically decide which electronic documents are which record types. Quite frankly, in the past, Ive had my doubts on auto-classification of records, but maybe its time to take a...
November 21, 2011 at 07:38 AM
6 minute read
The original version of this story was published on Law.com
The holy grail of records management is automated classification – taking people out of the record retention process and having computers automatically decide which electronic documents are which record types. Quite frankly, in the past, I've had my doubts on auto-classification of records, but maybe it's time to take a new look.
There are problems with traditional record retention approaches for electronic information. One traditional strategy is to have all employees manually classify all electronic records, but this does not work very well. Electronic information is growing about 30 percent to 60 percent per year. Deluged with electronic documents, it takes a lot of time for employees to classify them and many employees tend to put it off. In assessments across a number of companies, we have found these employee-driven, manually-oriented retention programs experience fairly low program compliance, especially for electronic documents. This is a big problem as electronic records typically constitute 90 percent of an organization's records.
The other approach we often see is implementing a monolithic retention strategy, for example saving all email for seven years. As only a fraction of emails are business records, this “save everything” strategy tends to drive significant over-retention, which is not a good thing. Furthermore, monolithic retention policies still don't ensure compliance, as a very small yet material number of critical electronic records need to be saved longer than the monolithic retention period. These records will be under-retained, driving noncompliance.
The record management discipline can benefit from the inroads being created by predictive coding in e-discovery. Predictive coding technologies allow first pass review of documents, doing a pretty good job of determining which documents are relevant. Even with predictive coding, people today still need to re-review the computer's work, but it does make the job easier. The same search and identification technologies currently being used in e-discovery are making their way into record retention products.
Questions arise whether auto-classification is compliant. There is very little case law validating predictive coding, and even less around the sufficiency of automatic classification for records management. Nevertheless, I think we are approaching the point where these auto-classification technologies do an imperfect, but better job of records classification than traditional methods. As record compliance is measured by how well you actually follow your policy, if auto-classification produces better end results than manual classification, one could argue that it is more compliant.
Even with the advent of this new technology, there is no “easy button” for auto-classification. While auto-classification holds the promise of decreasing the amount of time employees spend classifying records, creating a compliant and defensible program actually requires more work in the form of updating policies and creating new processes. These include creating clear, content-based retention schedules, identifying exemplar documents, creating testing programs as well as audit and review, and ensuring overall transparency. Auto-classification will mean doing more work upfront to reduce the amount of work employees will do.
While still in its infancy, auto-classification holds promise of better retention with less effort by employees. I have been a skeptic, but now I think it's worth taking a look.
The holy grail of records management is automated classification – taking people out of the record retention process and having computers automatically decide which electronic documents are which record types. Quite frankly, in the past, I've had my doubts on auto-classification of records, but maybe it's time to take a new look.
There are problems with traditional record retention approaches for electronic information. One traditional strategy is to have all employees manually classify all electronic records, but this does not work very well. Electronic information is growing about 30 percent to 60 percent per year. Deluged with electronic documents, it takes a lot of time for employees to classify them and many employees tend to put it off. In assessments across a number of companies, we have found these employee-driven, manually-oriented retention programs experience fairly low program compliance, especially for electronic documents. This is a big problem as electronic records typically constitute 90 percent of an organization's records.
The other approach we often see is implementing a monolithic retention strategy, for example saving all email for seven years. As only a fraction of emails are business records, this “save everything” strategy tends to drive significant over-retention, which is not a good thing. Furthermore, monolithic retention policies still don't ensure compliance, as a very small yet material number of critical electronic records need to be saved longer than the monolithic retention period. These records will be under-retained, driving noncompliance.
The record management discipline can benefit from the inroads being created by predictive coding in e-discovery. Predictive coding technologies allow first pass review of documents, doing a pretty good job of determining which documents are relevant. Even with predictive coding, people today still need to re-review the computer's work, but it does make the job easier. The same search and identification technologies currently being used in e-discovery are making their way into record retention products.
Questions arise whether auto-classification is compliant. There is very little case law validating predictive coding, and even less around the sufficiency of automatic classification for records management. Nevertheless, I think we are approaching the point where these auto-classification technologies do an imperfect, but better job of records classification than traditional methods. As record compliance is measured by how well you actually follow your policy, if auto-classification produces better end results than manual classification, one could argue that it is more compliant.
Even with the advent of this new technology, there is no “easy button” for auto-classification. While auto-classification holds the promise of decreasing the amount of time employees spend classifying records, creating a compliant and defensible program actually requires more work in the form of updating policies and creating new processes. These include creating clear, content-based retention schedules, identifying exemplar documents, creating testing programs as well as audit and review, and ensuring overall transparency. Auto-classification will mean doing more work upfront to reduce the amount of work employees will do.
While still in its infancy, auto-classification holds promise of better retention with less effort by employees. I have been a skeptic, but now I think it's worth taking a look.
This content has been archived. It is available through our partners, LexisNexis® and Bloomberg Law.
To view this content, please continue to their sites.
Not a Lexis Subscriber?
Subscribe Now
Not a Bloomberg Law Subscriber?
Subscribe Now
NOT FOR REPRINT
© 2024 ALM Global, LLC, All Rights Reserved. Request academic re-use from www.copyright.com. All other uses, submit a request to [email protected]. For more information visit Asset & Logo Licensing.
You Might Like
View AllAfter 2024's Regulatory Tsunami, Financial Services Firms Hope Storm Clouds Break
2024 in Review: Judges Met Out Punishments for Ex-Apple, FDIC, Moody's Legal Leaders
Financial Watchdog Alleges Walmart Forced Army of Gig-Worker Drivers to Receive Pay Through High-Fee Accounts
Trending Stories
Who Got The Work
Michael G. Bongiorno, Andrew Scott Dulberg and Elizabeth E. Driscoll from Wilmer Cutler Pickering Hale and Dorr have stepped in to represent Symbotic Inc., an A.I.-enabled technology platform that focuses on increasing supply chain efficiency, and other defendants in a pending shareholder derivative lawsuit. The case, filed Oct. 2 in Massachusetts District Court by the Brown Law Firm on behalf of Stephen Austen, accuses certain officers and directors of misleading investors in regard to Symbotic's potential for margin growth by failing to disclose that the company was not equipped to timely deploy its systems or manage expenses through project delays. The case, assigned to U.S. District Judge Nathaniel M. Gorton, is 1:24-cv-12522, Austen v. Cohen et al.
Who Got The Work
Edmund Polubinski and Marie Killmond of Davis Polk & Wardwell have entered appearances for data platform software development company MongoDB and other defendants in a pending shareholder derivative lawsuit. The action, filed Oct. 7 in New York Southern District Court by the Brown Law Firm, accuses the company's directors and/or officers of falsely expressing confidence in the company’s restructuring of its sales incentive plan and downplaying the severity of decreases in its upfront commitments. The case is 1:24-cv-07594, Roy v. Ittycheria et al.
Who Got The Work
Amy O. Bruchs and Kurt F. Ellison of Michael Best & Friedrich have entered appearances for Epic Systems Corp. in a pending employment discrimination lawsuit. The suit was filed Sept. 7 in Wisconsin Western District Court by Levine Eisberner LLC and Siri & Glimstad on behalf of a project manager who claims that he was wrongfully terminated after applying for a religious exemption to the defendant's COVID-19 vaccine mandate. The case, assigned to U.S. Magistrate Judge Anita Marie Boor, is 3:24-cv-00630, Secker, Nathan v. Epic Systems Corporation.
Who Got The Work
David X. Sullivan, Thomas J. Finn and Gregory A. Hall from McCarter & English have entered appearances for Sunrun Installation Services in a pending civil rights lawsuit. The complaint was filed Sept. 4 in Connecticut District Court by attorney Robert M. Berke on behalf of former employee George Edward Steins, who was arrested and charged with employing an unregistered home improvement salesperson. The complaint alleges that had Sunrun informed the Connecticut Department of Consumer Protection that the plaintiff's employment had ended in 2017 and that he no longer held Sunrun's home improvement contractor license, he would not have been hit with charges, which were dismissed in May 2024. The case, assigned to U.S. District Judge Jeffrey A. Meyer, is 3:24-cv-01423, Steins v. Sunrun, Inc. et al.
Who Got The Work
Greenberg Traurig shareholder Joshua L. Raskin has entered an appearance for boohoo.com UK Ltd. in a pending patent infringement lawsuit. The suit, filed Sept. 3 in Texas Eastern District Court by Rozier Hardt McDonough on behalf of Alto Dynamics, asserts five patents related to an online shopping platform. The case, assigned to U.S. District Judge Rodney Gilstrap, is 2:24-cv-00719, Alto Dynamics, LLC v. boohoo.com UK Limited.
Featured Firms
Law Offices of Gary Martin Hays & Associates, P.C.
(470) 294-1674
Law Offices of Mark E. Salomone
(857) 444-6468
Smith & Hassler
(713) 739-1250