Skip to main content
U.S. flag

An official website of the United States government

Here’s how you know

Dot gov

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

HTTPS

Secure .gov websites use HTTPS
A lock ( Lock A locked padlock ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

  • Environmental Topics
  • Laws & Regulations
  • Report a Violation
  • About EPA
Risk Assessment
Contact Us

Enhancing Evidence Interpretation and Database Integration Via Semantic Matching

On this page:

  • Overview
As part of implementing systematic review, the US Environmental Protection Agency’s (EPA) Integrated Risk Information System (IRIS) program extracts data from ~150 studies per year across 15-20 chemical assessments that are in the development phase. These data are stored in the Health Assessment and Workspace Collaborative (HAWC, https://hawcprd.epa.gov/about/ ) a free, open-source, and web-based application. Data extraction of author reported health findings have introduced a data consistency and semantic challenge because terms reported by authors are inconsistent (e.g., cytotoxicity, cell death; programmed cell death; and cell viability). Inconsistent language may lead to duplication and/or misinterpretation of study findings, make it difficult to efficiently retrieve information from HAWC, and pose a significant barrier to data exchange across different databases used to store toxicity findings.

To address these data inconsistencies, the author reported terms managed within EPA HAWC were matched to ontologies and ontology classes within Bioportal (https://bioportal.bioontology.org/ (a comprehensive repository of medical ontologies) to create a controlled vocabulary and ontology useful for expressing relationships between terms. The results (between the input [author term] and Bioportal ontology classes) were scored as: 1 = perfect match, 0.5 = synonym, and other values (0–1) for partial matches. The matching process returns other parameters (e.g. ontology, preferred name, synonym, class definition, class parent, parent definitions) that were used along with the numerical score to annotate author terms into a HAWC controlled vocabulary. The controlled vocabulary is critically important to unify study data managed by the HAWC database, whereas ontologies are used to query the database for relationships between those terms. The result is increased transparency and consistency in identifying and retrieving pertinent evidence during evidence synthesis. The EPA HAWC vocabulary and ontology are interoperable with other databases such as the Adverse Outcome Pathway (AOP) knowledge base and by class matching and ontology mapping can be integrated and used for advanced querying of potential relationships between exposure and outcome. The views expressed in this abstract are those of the authors and do not necessarily reflect the views or policies of the U.S. EPA.

Impact/Purpose

Leveraging SR methodologies with transparent and consistent data management practices as evidence addressing key science questions requires accurate curation of author reported data and integration with other databases. Semantic ontology concept matching can translate disparate author reported terms to: (1) preserve data in its original context; (2) promote consistency within the HAWC database; (3) make information more findable by facilitating the process of evidence integration; and (4) serve as a point of data integration with other databases of toxicity findings.

Citation

Angrish, M., S. Watford, G. Hodge, G. Woodall, AND A. Mudambi. Enhancing Evidence Interpretation and Database Integration Via Semantic Matching. NAS Workshop on Systematic Review of Mechanistic Data, Washington, D.C, December 10 - 11, 2018.
  • Risk Assessment Home
  • About Risk Assessment
  • Risk Recent Additions
  • Human Health Risk Assessment
  • Ecological Risk Assessment
  • Risk Advanced Search
    • Risk Publications
  • Risk Assessment Guidance
  • Risk Tools and Databases
  • Superfund Risk Assessment
  • Where you live
Contact Us to ask a question, provide feedback, or report a problem.
Last updated on May 26, 2021
United States Environmental Protection Agency

Discover.

  • Accessibility Statement
  • Budget & Performance
  • Contracting
  • EPA www Web Snapshots
  • Grants
  • No FEAR Act Data
  • Privacy
  • Privacy and Security Notice

Connect.

  • Data
  • Inspector General
  • Jobs
  • Newsroom
  • Open Government
  • Regulations.gov
  • Subscribe
  • USA.gov
  • White House

Ask.

  • Contact EPA
  • EPA Disclaimers
  • Hotlines
  • FOIA Requests
  • Frequent Questions

Follow.