Skip to content

mcs07/ChemDataExtractor

This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

74ab31d · Aug 17, 2017
Aug 17, 2017
Oct 11, 2016
Jan 22, 2017
Feb 2, 2017
Aug 17, 2017
Feb 3, 2017
Apr 16, 2016
Feb 21, 2017
Feb 3, 2017
Jan 22, 2017
Feb 2, 2017
Feb 3, 2017
Feb 2, 2017
Feb 21, 2017
Jan 22, 2017
Feb 3, 2017

Repository files navigation

ChemDataExtractor

http://img.shields.io/pypi/v/ChemDataExtractor.svg?style=flat-square http://img.shields.io/pypi/l/ChemDataExtractor.svg?style=flat-square http://img.shields.io/travis/mcs07/ChemDataExtractor.svg?style=flat-square

ChemDataExtractor is a toolkit for extracting chemical information from the scientific literature.

Features

  • HTML, XML and PDF document readers
  • Chemistry-aware natural language processing pipeline
  • Chemical named entity recognition
  • Rule-based parsing grammars for property and spectra extraction
  • Table parser for extracting tabulated data
  • Document processing to resolve data interdependencies

Installation

To install ChemDataExtractor, simply run:

pip install chemdataextractor

Or if you are an Anaconda user, run:

conda install -c chemdataextractor chemdataextractor

Alternatively, try one of the other installation options.

Documentation

Full documentation is available at http://chemdataextractor.org/docs

License

ChemDataExtractor is licensed under the MIT license, a permissive, business-friendly license for open source software.