The Impact of Automated Exploratory Data Analysis in Litigation

Speeding up the exploratory data analysis process using flexible automation and consistent reporting allows analysts to deliver analyses quickly while ensuring precise, accurate results.

July 27, 2020 at 07:00 AM

5 minute read

By Mike Horoho

By Anthony Kelly, FTI Consulting

data collection

In most litigation matters requiring complex data analytics, our consultants receive vast quantities of data with little to no information regarding its contents. The volume of data alone makes a comprehensive manual review nearly impossible, especially under the tight time constraints imposed by many litigation scenarios. While data is often critical in mounting a successful legal strategy, large quantities of data without context creates an impenetrable black box, obscuring any data calculation processes.

Asking the Right Questions

Effective analytics requires a thorough understanding of the data being analyzed and can be achieved through exploratory data analysis (EDA). EDA starts by asking some basic questions of data sets to make an initial assessment of the data's structure and quality to facilitate a plan of attack. These questions include:

Is there an existing unique record id?
Are fields populated with numbers, dates, or alpha-numeric characters?
What is the distribution of numeric values?
How many values exist for categorical fields?
How many fields have been left blank?
How are multiple data sets related?

Once this initial assessment has been completed, EDA continues by asking questions that examine what illogical conditions might exist that provide an indication of the overall quality and reliability of the data. Some hypothetical examples of such questions are:

Why are there accounting entries with date stamps that precede the existence of the company being analyzed?
Why are 95 percent of the records blank for a field?
Why is one accounting entry 1000x the size of the next largest?
Why are a smaller number of values negative instead of positive for a specific field?

Answering these questions is a simple endeavor when dealing with a small number of fields and records; however, the task becomes much more complex and time-consuming when dealing with millions of records and hundreds of fields. When data analytics in a litigation setting requires precision under accelerated timelines, EDA becomes critical to delivering timely analysis.

Applying the Right Technology

Data analytics experts use computer code that can be deployed on any data set, speeding up the EDA process and enhancing efficiency without sacrificing precision. Custom coding systems can be connected to data sets, automating a large portion of the EDA process. Since data can be stored in many ways, these custom coding systems are flexible enough to access information across a variety of formats including:

Numerous relational database management systems (Microsoft SQL Server, Oracle Database, MySQL, etc.)
Text files
Microsoft Excel workbooks

Once the program is connected to a dataset, the information can be fed into the system in a standard format. Standardization allows these coding systems to run the same set of analyses regardless of how the data is natively stored.

One best practice is to create a reporting document that can be distributed to both technical and non-technical personnel, thus increasing the number of people who have "eyes on the data." Review of the standard reporting document reveals important features of a dataset and helps raise critical questions about illogical data conditions. Further, it can quickly reveal columns with less importance, allowing analysts to narrow their focus on the most impactful fields which creates downstream efficiencies during production of the analysis at hand.

Analyzing a Company's Data

Folding EDA into a broader engagement strategy is critical to the success of projects that require analytics on complex data sets. Without the capabilities of automated EDA, the task of deriving insights across a high number of variables of unknown type and meaning would be challenging and time-consuming.

Deploying automated EDA tools upon acquisition of a company's data set and review of standard reporting packages should be a required step. Upon review of these reports, it may be discovered that, while the dataset appeared robust at the outset, many fields were not viable for analysis. Through review of date fields, reports may also reveal large, critical portions of a data set are missing. These issues, while quickly and painlessly resolved with EDA, could invalidate a large amount of work if left undiscovered until later. By reviewing reporting packages with the client, the EDA process can identify and fill gaps in the data, focus the analysis and provide valuable, timely insights.

Speeding up the EDA process using flexible automation and consistent reporting allows analysts to deliver analyses quickly while ensuring precise, accurate results.

Mike Horoho is a Senior Director and Anthony Kelly is a Senior Consultant, both with the Forensic & Litigation Consulting segment of FTI Consulting, Inc. The views expressed herein are those of the author(s) and not necessarily the views of FTI Consulting, Inc., its management, its subsidiaries, its affiliates, or its other professionals. FTI Consulting, Inc., including its subsidiaries and affiliates, is a consulting firm and is not a certified public accounting firm or a law firm.

This content has been archived. It is available through our partners, LexisNexis® and Bloomberg Law.

To view this content, please continue to their sites.

Go To Lexis →

Not a Lexis Subscriber?
Subscribe Now

Go To Bloomberg Law →

Not a Bloomberg Law Subscriber?
Subscribe Now

NOT FOR REPRINT

You Might Like

Latest

Trending

Who Got The Work

J. Brugh Lower of Gibbons has entered an appearance for industrial equipment supplier Devco Corporation in a pending trademark infringement lawsuit. The suit, accusing the defendant of selling knock-off Graco products, was filed Dec. 18 in New Jersey District Court by Rivkin Radler on behalf of Graco Inc. and Graco Minnesota. The case, assigned to U.S. District Judge Zahid N. Quraishi, is 3:24-cv-11294, Graco Inc. et al v. Devco Corporation.

Who Got The Work

Rebecca Maller-Stein and Kent A. Yalowitz of Arnold & Porter Kaye Scholer have entered their appearances for Hanaco Venture Capital and its executives, Lior Prosor and David Frankel, in a pending securities lawsuit. The action, filed on Dec. 24 in New York Southern District Court by Zell, Aron & Co. on behalf of Goldeneye Advisors, accuses the defendants of negligently and fraudulently managing the plaintiff's $1 million investment. The case, assigned to U.S. District Judge Vernon S. Broderick, is 1:24-cv-09918, Goldeneye Advisors, LLC v. Hanaco Venture Capital, Ltd. et al.

Who Got The Work

Attorneys from A&O Shearman has stepped in as defense counsel for Toronto-Dominion Bank and other defendants in a pending securities class action. The suit, filed Dec. 11 in New York Southern District Court by Bleichmar Fonti & Auld, accuses the defendants of concealing the bank's 'pervasive' deficiencies in regards to its compliance with the Bank Secrecy Act and the quality of its anti-money laundering controls. The case, assigned to U.S. District Judge Arun Subramanian, is 1:24-cv-09445, Gonzalez v. The Toronto-Dominion Bank et al.

Who Got The Work

Crown Castle International, a Pennsylvania company providing shared communications infrastructure, has turned to Luke D. Wolf of Gordon Rees Scully Mansukhani to fend off a pending breach-of-contract lawsuit. The court action, filed Nov. 25 in Michigan Eastern District Court by Hooper Hathaway PC on behalf of The Town Residences LLC, accuses Crown Castle of failing to transfer approximately $30,000 in utility payments from T-Mobile in breach of a roof-top lease and assignment agreement. The case, assigned to U.S. District Judge Susan K. Declercq, is 2:24-cv-13131, The Town Residences LLC v. T-Mobile US, Inc. et al.

Who Got The Work

Wilfred P. Coronato and Daniel M. Schwartz of McCarter & English have stepped in as defense counsel to Electrolux Home Products Inc. in a pending product liability lawsuit. The court action, filed Nov. 26 in New York Eastern District Court by Poulos Lopiccolo PC and Nagel Rice LLP on behalf of David Stern, alleges that the defendant's refrigerators’ drawers and shelving repeatedly break and fall apart within months after purchase. The case, assigned to U.S. District Judge Joan M. Azrack, is 2:24-cv-08204, Stern v. Electrolux Home Products, Inc.

Learn More About Radar

Featured Firms

Law Offices of Gary Martin Hays & Associates, P.C.

(470) 294-1674

Law Offices of Mark E. Salomone

(857) 444-6468

Smith & Hassler

(713) 739-1250

The Impact of Automated Exploratory Data Analysis in Litigation

Asking the Right Questions

Applying the Right Technology

Analyzing a Company's Data

This content has been archived. It is available through our partners, LexisNexis® and Bloomberg Law.

You Might Like

Who Got The Work

Who Got The Work

Who Got The Work

Who Got The Work

Who Got The Work

Featured Firms

More from ALM

Subscribe to Legal Tech News