Skip to main content
U.S. flag

An official website of the United States government

Here’s how you know

Dot gov

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

HTTPS

Secure .gov websites use HTTPS
A lock ( Lock A locked padlock ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

  • Environmental Topics
  • Laws & Regulations
  • Report a Violation
  • About EPA
Risk Assessment
Contact Us

Data Annotation and Migration Across Systematic Review Tools

On this page:

  • Overview
Automatic update of the information supporting an assessment is limited by manual finding, uploading, and migration of information between various systematic review tools. This is a particular challenge in the development of chemical assessments where a rapid compilation of new information and/or update to existing information is needed in a pre-decisional, regulatory context. Artificial intelligence was previously used to rapidly screen >40K studies for ~150 perfluoroalkyl substances and manually extracted data were summarized in systematic evidence maps (SEMs). Keeping these influential SEMs updated with the latest relevant research is time-consuming and labor-intensive. Our goal is to use machine learning models to reduce this ongoing effort. However, creating such models first requires detailed, machine-readable annotation of datasets to label entities of interest. The structured data extraction templates supporting DistillerSR for the PFAS SEM were used to create entities (labels for text) that were manually annotated from the titles, abstracts, methods, and results sections from 67 animal toxicology PDFs using the FIDDLE Extraction Workbench. A total of >24K annotations were generated from 12 entities across the corpus. Pilots targeting the migration of annotated data outputs revealed the needs that are the focus of current work. These include  mapping  annotated terms that do not follow convention (e.g. linking to controlled vocabularies and ontologies),piloting annotation tool grouping and relating annotations according to predefined schema, and input/output formats that support interoperability between tools. These views do not necessarily reflect those of the US EPA.

Impact/Purpose

Automatic update of the information supporting an assessment is limited by manual finding, uploading, and migration of information between various systematic review tools. This is a particular challenge in the development of chemical assessments where a rapid compilation of new information and/or update to existing information is needed in a pre-decisional, regulatory context. Keeping these influential SEMs updated with the latest relevant research is time-consuming and labor-intensive. Our goal is to use machine learning models to reduce this ongoing effort.

Citation

Angrish, M. Data Annotation and Migration Across Systematic Review Tools. 10th Annual Meeting of the ASCCT, virtual, NA -Virtual, October 12 - 14, 2021.
  • Risk Assessment Home
  • About Risk Assessment
  • Risk Recent Additions
  • Human Health Risk Assessment
  • Ecological Risk Assessment
  • Risk Advanced Search
    • Risk Publications
  • Risk Assessment Guidance
  • Risk Tools and Databases
  • Superfund Risk Assessment
  • Where you live
Contact Us to ask a question, provide feedback, or report a problem.
Last updated on May 27, 2022
United States Environmental Protection Agency

Discover.

  • Accessibility Statement
  • Budget & Performance
  • Contracting
  • EPA www Web Snapshots
  • Grants
  • No FEAR Act Data
  • Privacy
  • Privacy and Security Notice

Connect.

  • Data
  • Inspector General
  • Jobs
  • Newsroom
  • Open Government
  • Regulations.gov
  • Subscribe
  • USA.gov
  • White House

Ask.

  • Contact EPA
  • EPA Disclaimers
  • Hotlines
  • FOIA Requests
  • Frequent Questions

Follow.