About

The U.S. Supreme Court Database (SCDB) is a comprehensive, public, multi-user data resource containing information about every case decided by the United States Supreme Court from its first decision in 1791 to today. The Database forms the basis of nearly every empirical study of U.S. Supreme Court decisionmaking in the past forty years and is arguably the most important single data source in empirical law and social science. Virtually all systematic analyses of the contemporary Supreme Court and its members have relied on it. This holds for research conducted by social scientists and their graduate students and, increasingly, by legal academics; and it holds for quantitative and qualitative studies, as well as those more descriptive in nature. In fact, several inventories of peer-reviewed journals show that it is the rare article on the Court that derives its data from an alternative source. Monographs published by top presses also regularly rely on the Database, and the many numerical studies of the Court receiving public attention in recent years have made liberal use of the data it houses. By the same token, journalists seeking to illuminate dimensions of the Court’s work regularly deploy Spaeth’s product; indeed, Linda Greenhouse, the Pulitzer-prize winning reporter, once referred to it as “a computerized treasure trove…created under a grant from the National Science Foundation,” and has cited it (or research relying on it) in her writings.

The U.S. Supreme Court Database traces its history to more than three decades ago, when Harold J. Spaeth asked the National Science Foundation to fund a database that would be so rich in content that multiple users—even those with vastly distinct projects and purposes in mind—could draw on it. Professor Spaeth’s goal was at once refreshingly simple and extremely ambitious: to produce a database that would include and classify every single vote by a Supreme Court justice in all argued cases over a five-decade period. After securing the funding, Spaeth collected and coded the data, performed reliability checks, and eventually amassed the Database. In the late 1980s, he made it (and the documentation necessary to use it) publicly available.

Then, decades later, a team of researchers led by Professors Lee Epstein and Andrew D. Martin at Washington University in St. Louis received additional funding to modernize the database and to backdate it to the Court’s first decision in 1791. Today’s version of the Database houses 247 pieces of information for each case, roughly broken down into six categories:

Identification variables (e.g., citations and docket numbers)
Background variables (e.g., how the Court took jurisdiction, origin and source of the case, the reason the Court agreed to decide it)
Chronological variables (e.g., the date of decision, term of Court, natural court)
Substantive variables (e.g., legal provisions, issues, direction of decision)
Outcome variables (e.g., disposition of the case, winning party, formal alteration of precedent, declaration of unconstitutionality)
Voting and opinion variables (e.g., how the individual justices voted, their opinions and interagreements)

In 2024, the database moved from Washington University in St. Louis to Penn State where it now has its home in the Initiative for Legal Institutions and Democracy in the McCourtney Institute for Democracy. Professor Spaeth’s dedication to democratizing access to reliable and valid data on the U.S. Supreme Court continues for a new generation of students, scholars, journalists, and the public.

The U.S. Supreme Court Database (SCDB) is a comprehensive, public, multi-user data resource containing information about every case decided by the United States Supreme Court from its first decision in 1791 to today. The Database is arguably the most important single data source in empirical law and social science. Virtually all systematic analyses of the contemporary Supreme Court and its members have relied on it. This holds for research conducted by social scientists and their graduate students and, increasingly, by legal academics; and it holds for quantitative and qualitative studies, as well as those more descriptive in nature. In fact, several inventories of peer-reviewed journals show that it is the rare article on the Court that derives its data from an alternative source. Monographs published by top presses also regularly rely on the Database, and the many numerical studies of the Court receiving public attention in recent years have made liberal use of the data it houses. By the same token, journalists seeking to illuminate dimensions of the Court’s work regularly rely on the Database; indeed, Linda Greenhouse, the Pulitzer-prize winning reporter, once referred to it as “a computerized treasure trove…created under a grant from the National Science Foundation,” and has cited it (or research relying on it) in her writings.

In short, the U.S. Supreme Court Database has not just helped fill gaps in our knowledge. It is one of those rare creatures in the law and social science world: an invention that has substantially advanced a large area of study, inspiring research by scholars hailing from no fewer than three and as many as seven disciplines.