File(s) stored somewhere else
Please note: Linked content is NOT stored on The United States Environmental Protection Agency’s Center for Computational Toxicology and Exposure and we can't guarantee its availability, quality, security or accept any liability.
Chemical and Product Categories (CPCat) database
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
The EPA CPCat (Chemical and Product Categories) database was released in May 2014. It maps >43,000 chemicals to a set of terms categorizing their usage or function. We have compiled a comprehensive list of chemicals with associated categories of chemical and product use by compiling publicly available sources. Sources include, but are not limited to: the Substances in Preparation in Nordic Countries (SPIN) database, information provided by companies, trade associations, and regulatory agencies such as the U.S. Environmental Protection Agency (EPA) and Food and Drug Administration (FDA), the DrugBank database of pharmaceutical products, and information mined from the Aggregated Computational Toxicology Resource (ACToR) database developed by the U.S. EPA. Unique use category taxonomies from each source are mapped onto a single common set of ~800 terms. The user can search for chemicals by chemical name, Chemical Abstracts Registry Number (CASRN), or by CPCat terms (i.e. category names) associated with chemicals. See Dionisio et al., 2014 for a full description of the database, sources used, interpretation of chemical categories, and potential applications. The .zip file available at the "Download" tab of this website provides a full copy of the database, available for free download, which can be freely searched and sorted for data analysis. The .zip file includes a list of all chemicals included in CPCat. A list of all sources included in CPCat is provided in the table below.