Federal and state courts have weighed in and generally concur that e-discovery must be proportional to the matter. The question remains, what does that actually mean? With data volumes exponentially increasing in scope due to the ever-increasing use of email, texting and social media, how do lawyers create proportional, accurate responses to e-discovery requests? We delved into a side by side analysis of a discovery request utilizing two different workflow methodologies to illustrate the vast difference technology and process can have on the cost of a production. We analyzed two scenarios with the same data set. These produced starkly different economic results.

Scenario “A” followed a more traditional approach to leveraging e-discovery technology and consisted of a single matter and a full collection of custodian inboxes for 10 custodians. Prior to the collection effort, the firm negotiated key terms and date ranges with opposing counsel. The data was processed and any documents that hit on the agreed searches were promoted to a review platform. A single service provider was selected to provide the data processing, hosting, litigation support, and a team of contract attorneys to assist with the first-pass review efforts related to meeting the initial production obligations.

Scenario “B” was similar in virtually every way, however, all collected data was processed and loaded into a review platform to allow for early analysis and keyword searches to be tested prior to negotiating with opposing counsel. The same service provider was selected; however, a slightly different pricing model was applied due to the change in approach. In this scenario, because the firm elected to host all data that was processed, a flat fee of $25 per GB, monthly, was offered that incorporated all data processing, hosting and analytics fees.

With each scenario we answered the following questions:

  • How long did each phase of the discovery process take? What were the total technology costs? How many hours were billed by the various disciplines necessary?
  • Separately, what pitfalls were encountered due to midstream scope changes? How did these impact turnaround time and total project costs due to the additional work?

For each scenario, the tables below step through the daily discovery lifecycle with a midstream change in scope that required the team to perform additional searches to identify documents previously not considered because they were believed to be nonresponsive. In the first example, this pivot added several days to the overall effort and resulted in a full production being completed in approximately three months. In the second example, the same change in scope did not have nearly the same impact and production was completed inside of two months.

Section A

Section B

The major difference between the two approaches is the decision to promote all data into the review environment in the second example, versus only processing and hosting data that hit on search terms in the first example. In the first scenario, when the case team was required to “go back to the well,” to perform additional searches it required the service provider to process data previously left on the cutting room floor and promote it to the review environment. Prior to promoting the documents for an additional wave of review, any documents that had previously hit on the original set of search terms needed to be parsed from the population. Furthermore, because the search results could not be analyzed until the documents were published to the review environment, the volume of documents that were presented to reviewers was a bit higher because the case team chose to apply searches more broadly as a defense against future scope changes.

The second approach saw the case team spending nearly five days working within the review environment testing searches and leveraging analytics technology to increase their precision and reduce the overall number of documents presented to reviewers. They knew that if they needed to add documents to the review set, it would require a simple matter of identifying the documents and batching them out. In the first scenario, roughly 240,000 documents were reviewed, whereas in the second only 200,000 were reviewed.

The second scenario also benefited from the use of analytics during the review process. By clustering the documents into smaller batches that contained conceptually similar content and grouping all emails within a conversation thread into the same batch, the pace at which the records were reviewed increased by ten documents per hour.

Ultimately the costs for the technology were slightly lower in the first scenario; a difference of nearly $5000 in the aggregate. Had there not been a change midstream that required additional searches, the pricing model used in the first example would have been closer to $10,000 less. However, the decision by the case team to promote all data to the review environment to allow for the use of analytics technology to assist with the culling and review efforts drastically reduced the number of billable hours that were needed to garner the same end result. The slightly lower document count combined with the increase in review pace saved over $100,000 and was completed in less time.

In the early 2000s, when discovery began to see the incorporation of electronic documents in addition to paper, the technology of the day forced case teams to take the same approach that is represented in the first scenario. This methodology presents limited options for applying more precise culling strategies without incurring  a cost-prohibitive bill tied to hosting data. Unfortunately, for various reasons, even though the tools available have matured, the approach towards e-discovery has changed little over the years. Many still view the technology costs associated with discovery in a vacuum and seldom consider the extra expenditure in billable hours that will result from making certain workflow decisions early in the process. More importantly, the assumption that midstream changes will not occur during the fact-finding process has proven to be a risky proposition. While no two matters are identical, virtually all data mining and discovery efforts reveal new information that ultimately influences the case strategy going forward.

In summary, the technology available with today's advanced tools requires a learning curve to properly navigate, and appears more costly without an analysis of review hours. However, when technology assisted review is properly utilized, the time required to review the data decreases, and the overall budget is less expensive.

Dan Beaty is a former practicing attorney who switched gears five years ago and moved to Special Counsel to serve as their director of strategic solutions. He consults with corporate legal departments and law firms to harness technology and project management processes to maximize the effectiveness of legal spend. 

Lisa Goldstein, as the business development manager of litigation at Buchanan Ingersoll & Rooney, Goldstein assists the firm's litigation lawyers with servicing existing clients and expanding their focus across core industries in the life science, energy, financial service and health care industries.

Kristopher Wasserman is vice president and senior consultant at Special Counsel. He oversees a team of solutions directors that provide technical expertise and guidance to corporate legal departments and law firms to develop defensible cost-effective solutions that involve managing and reviewing data that may be used as evidence.