pyConTextNLP

Python ConText implementation for NLP

What is pyConTextNLP?

pyConTextNLP is a partial implementation of the ConText algorithm using Python. The original description of pyConTextNLP was provided in Chapman BE, Lee S, Kang HP, Chapman WW, "Document-level classification of CT pulmonary angiography reports based on an extension of the ConText algorithm." J Biomed Inform. 2011 Oct;44(5):728-37

Since that publication pyConTextNLP has undergone several important revisions:

  1. Incorporating NetworkX to describe target/modifier relationships.
  2. Porting from Python 2.x to Python 3.x
    • This is a work in progress. pyConTextNLP does not have a clean transition for handling unicode in Python 2.x in my attempts to port to 3.x
  3. Rewriting pyConTextNLP to have a more functional style.
    • This has been motivated by both the need to incorporate parallel processing into the algorithm for speed and to reduce unintended side effects.
    • This work currently lies in the subpackage functional.

Dependencies

  • NetworkX for relating ConText relationships.
  • TextBlob for sentence splitting.
  • nose for unit testing.

Installation

pyConTextNLP is hosted on GitHub and is index in pypi so can be installed with pip:

pip install pyConTextNLP

Bibliography