Article
12 October 2022

5 Questions with Mike DeCesaris: AI/ML Efficiency Driven by GPUs

5 Questions is a periodic feature produced by Cornerstone Research, which asks our professionals, senior advisors, or affiliated experts to answer five questions.

We interview Mike DeCesaris, vice president of Cornerstone Research’s Data Science Center, about the benefits of working with GPUs, and how they enhance artificial intelligence (AI) and machine learning (ML) techniques.

What are GPUs?

Specialized graphics processing units (GPUs), as the name suggests, were originally designed decades ago for the efficient performance of operations common to the processing of images and video. These processes heavily feature matrix-based mathematical calculations. People are generally more familiar with central processing units (CPUs), which are found in laptops, phones, and smart devices, and can perform many different types of operations.

In the early 2000s, researchers realized that, because machine learning algorithms often feature the same type of calculations as graphics processing algorithms, GPUs could provide a more efficient alternative to CPU-based computation for machine learning. Despite availability and cost constraints relative to CPUs in recent years, GPU-based computation has become the de facto standard for machine learning or neural network training.

What are the benefits of using GPUs?

The key benefit is efficiency. The computing efficiency that GPUs provide does more than streamline the analytical process. It facilitates more extensive model training for greater accuracy, expands the scope of the model search process to guard against alternative specifications, makes feasible certain models that previously were infeasible, and allows for additional sensitivities on alternative datasets to ensure robustness.

How do GPUs support expert testimony?

AI-based systems substitute human decisions with data-driven ones. This can reduce subjectivity and error when processing large volumes of complex information. We utilize AI and ML to drive automation of increasingly complex tasks and unlock new approaches for analysis, including using both supervised and unsupervised learning. These techniques are supported by our in-house GPUs.

How does the Data Science Center leverage GPU computing?

We use GPUs at all stages of the case lifecycle, from discovery to economic analysis, and for all types of data, from standard tabular data to text and images. Some of these applications rely on applications where GPU computing has become popular, like neural networks, while others rely on more customized analytical frameworks. Some examples follow.

Matrix arithmetic

GPUs enable us to perform custom matrix arithmetic at rapid speed. For example, in antitrust matters, we often need to calculate the distance between all suppliers and all consumers (coordinate pairs). Migrating this computation from CPUs to GPUs enables us to calculate distances between nearly 100 million coordinate pairs per second.

Deep neural networks

Much of the excitement surrounding GPU-based computation focuses on neural networks. While capable of handling routine classification and regression problems, additional task-specific neural network architectures provide a framework for specialized analyses of text, images, and sound. Given the complexity of these models and the volume of data required to generate reliable results, their use is effectively infeasible without GPU computing resources. When training a popular multi-class image model on a GPU, we experienced a 25,000% speedup compared to running the same process on a single CPU. We leverage this efficiency in content analyses for consumer fraud matters, where we design text and image classifiers to characterize the intended audience of at-issue marketing materials.

Boosted trees

As GPU computing has become more ubiquitous, popular machine learning software packages have increasingly included GPU-based computation options in their offerings. We often use boosted trees in regression and classification problems. These models sequentially aggregate multiple simple decision trees into a larger, more accurate learner. Compared to deep neural networks, which may feature hundreds of millions of parameters, these models are smaller and thus require less data and training time to produce generalizable inferences. These advantages lead them to be more useful than deep neural networks for many of the types of analyses that we regularly encounter. Switching to GPU-based training processes enables us to train models for these tasks nearly 100 times faster than the corresponding CPU specification.

Language models

Language models, often based on one or more deep learning techniques, can classify, parse, and generate text. We employ large language models to extract specific pieces of information, parse relationships between entities, identify semantic relationships, and supplement traditional term-based features in text classification problems, such as in the quantification of social media sentiment surrounding a public entity in defamation matters.

Unsurprisingly, given all that these models can do, processing documents through these models via CPU can introduce significant delays to the analytical process. With just a single GPU, we can segment documents into individual components and fully process several hundreds of sentences per second.

What developments can we expect in this space in the future?

GPUs and GPU-related software will continue to evolve. New hardware may feature more cores, faster cores, and more memory to accommodate larger models and data batches. New software may make it even easier to share models and data across multiple GPUs.

Other developments may involve different devices altogether. To address some of the inefficiencies still present in GPU computing, machine learning practitioners have increasingly turned to application-specific integrated circuits (ASIC) and field-programmable gate arrays (FPGAs). For example, Google’s tensor processing unit (TPU) is an ASIC designed specifically to perform calculations for its TensorFlow software package for machine learning. FPGAs offer more flexibility and are typically used to deploy machine learning models in production environments that require low latency, high bandwidth, and minimal energy consumption.

We continue to monitor developments in this space to ensure that we continue to provide best-in-class service to our clients and experts.

Interviewee

San Francisco

Mike DeCesaris

Vice President, Data Science Center

Cookie	Duration	Description
AWSELB	session	Associated with Amazon Web Services and created by Elastic Load Balancing, AWSELB cookie is used to manage sticky sessions across production servers.
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	CookieYes sets this cookie to record the default button state of the corresponding category and the status of CCPA. It works only in coordination with the primary cookie.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
_cfuvid	session	The _cfuvid cookie is used to allow the Cloudflare WAF to distinguish individual users who share the same IP address. Visitors who do not provide the cookie are likely to be grouped together and may not be able to access the site if there are many other visitors from the same IP address.
cf_clearance	1 year	The cf_clearance cookie is used by Cloudflare to verify that visitors have successfully passed a security challenge and can access the website.
PBSECURESUSID	session	This cookie is set by the provider Podbean. This is a session cookie used to verify that the users are on secure sessions. It helps iin implementing audio files on the website.
wpEmojiSettingsSupports	session	WordPress sets this cookie when a user interacts with emojis on a WordPress site. It helps determine if the user's browser can display emojis properly.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_gat_UA-*	1 minute	Google Analytics sets this cookie for user behaviour tracking.
_gat_UA-12672498-1	1 minute	A variation of the _gat cookie set by Google Analytics and Google Tag Manager to allow website owners to track visitor behaviour and measure site performance. The pattern element in the name contains the unique identity number of the account or website it relates to.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
vuid	2 years	Vimeo-generated ID used for generating analytics information for the video owner.

Cookie	Duration	Description
_guid	90 days	linkedin.com - Used to identify a LinkedIn Member for advertising through Google ads - LinkedIn
AMCVS_14215E3D5995C57C0A495C55%40AdobeOrg	session	.linkedin.com - Indicates the start of a session for Adobe Experience Cloud - Adobe
AnalyticsSyncHistory	30 days	.linkedin.com - Used to store information about the time a sync took place with the lms_analytics cookie - LinkedIn
bcookie	1 year	.linkedin.com - Browser Identifier cookie used for diagnostic purposes. - LinkedIn
dfpfpt	2 years	.linkedin.com - Unique user identifier to prevent abuse in payment workflows for LinkedIn - LinkedIn
fptctx2	session	.linkedin.com - Used to prevent abuse in payment workflows for LinkedIn - Microsoft
gpv_pn	6 months	.linkedin.com - Used to retain and fetch previous page visited in Adobe Analytics - Adobe
lang	session	.linkedin.com - Used to remember a user's language setting to ensure LinkedIn.com displays in the language selected by the user in their settings. - LinkedIn
li_gp	1 year	.linkedin.com - Stores privacy preferences for guests to LinkedIn - LinkedIn
li_sugr	90 days	.linkedin.com - Used to make a probabilistic match of a user's identity - LinkedIn
liap	1 year	.linkedin.com - Used by non-www.domains to denote the logged in status of a member - LinkedIn
lidc	24 hours	.linkedin.com - To facilitate data center selection - LinkedIn
lms_ads	30 days	.linkedin.com - Used to identify LinkedIn Members off LinkedIn for advertising - LinkedIn
lms_analytics	30 days	.linkedin.com - Used to identify LinkedIn Members off LinkedIn for analytics - LinkedIn
s_cc	session	.linkedin.com - Used to determine if cookies are enabled for Adobe Analytics - Adobe
s_fid	180 days	.linkedin.com - Unique identifier for Adobe Analytics - Adobe
s_ips	session	.linkedin.com - Tracks percent of page viewed - Adobe
s_plt	session	.linkedin.com - Tracks the time that the previous page took to load - Adobe
s_ppv	session	.linkedin.com - Used by Adobe Analytics to retain and fetch what percentage of a page was viewed - Adobe
s_sq	session	.linkedin.com - Used to store information about the previous link that was clicked on by the user by Adobe Analytics - Adobe
s_tp	session	.linkedin.com - Tracks percent of page viewed - Adobe
s_tslv	6 months	.linkedin.com - Used to retain and fetch time since last visit in Adobe Analytics - Adobe
UserMatchHistory	30 days	linkedin.com - Used for id sync process. It stores the last sync time to avoid repeating the syncing process in a frequent manner - LinkedIn

5 Questions with Mike DeCesaris: AI/ML Efficiency Driven by GPUs

5 Questions is a periodic feature produced by Cornerstone Research, which asks our professionals, senior advisors, or affiliated experts to answer five questions.

What are GPUs?

What are the benefits of using GPUs?

How do GPUs support expert testimony?

How does the Data Science Center leverage GPU computing?

What developments can we expect in this space in the future?

Interviewee

Related Publications

Data Science Center Tames Big Data Projects

5 Questions with Mike DeCesaris: Extracting Meaningful Insights from Unstructured Data

The Role of Artificial Intelligence and Machine Learning in Competition Proceedings: Key Takeaways