Skip to main content
U.S. flag

An official website of the United States government

Here’s how you know

Dot gov

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

HTTPS

Secure .gov websites use HTTPS
A lock ( Lock A locked padlock ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

  • Environmental Topics
  • Laws & Regulations
  • Report a Violation
  • About EPA
Risk Assessment
Contact Us

Informing environmental health policy on complex mixtures - what we need vs. what we currently get from machine learning methods:development of semi-automated data extraction models for human health assessments.

On this page:

  • Overview
EPA is working to streamline the human effort required to identify and extract salient data from environmental epidemiology studies and summarize information on study design and results. These streamlining efforts are being achieved by adopting automated and semiautomated processes for data extraction such as Named Entity Recognition (NER) and various machine learning techniques. This presentation discusses an approach for developing extraction algorithms which overviews the use of extraction software tools to annotate studies to create the datasets used to train the extraction algorithms (to identify  epidemiologic data of interest), the issues faced in that process and the next steps for developing the data extraction model.

Impact/Purpose

The growing body of epidemiolgical literature presents opportunities for integrating the knowledge gained through human health assessment research that is used to inform policy. This presentation provides a brief overview of the the intitial process and methods taken to develop a data extraction model which utilizes machine learning applications to facilitate development of human health assessments and considerations for integrating such data into public health policy.

Citation

Wilkins, A., K. Christensen, E. Radke-Farabaugh, AND B. Howard. Informing environmental health policy on complex mixtures - what we need vs. what we currently get from machine learning methods:development of semi-automated data extraction models for human health assessments. International Society of Exposure Science (ISES), NA, August 30 - September 02, 2021.
  • Risk Assessment Home
  • About Risk Assessment
  • Risk Recent Additions
  • Human Health Risk Assessment
  • Ecological Risk Assessment
  • Risk Advanced Search
    • Risk Publications
  • Risk Assessment Guidance
  • Risk Tools and Databases
  • Superfund Risk Assessment
  • Where you live
Contact Us to ask a question, provide feedback, or report a problem.
Last updated on September 13, 2021
United States Environmental Protection Agency

Discover.

  • Accessibility Statement
  • Budget & Performance
  • Contracting
  • EPA www Web Snapshots
  • Grants
  • No FEAR Act Data
  • Privacy
  • Privacy and Security Notice

Connect.

  • Data
  • Inspector General
  • Jobs
  • Newsroom
  • Open Government
  • Regulations.gov
  • Subscribe
  • USA.gov
  • White House

Ask.

  • Contact EPA
  • EPA Disclaimers
  • Hotlines
  • FOIA Requests
  • Frequent Questions

Follow.