Projects‎ > ‎

NERBio


Demo systems

Resources

Related Annotation Tools

  • NLProt: a tool for finding protein-names in natural language-text. NLProt is based on Support Vector Machines (SVMs), which are trained on contextual-features of named entities (NEs) in scientific language. Additionally, simple filtering rules and a protein-name dictionary are used to increase performance. NLProt reached a precicion (accuracy) of 70% at a recall (coverage) of 85% after running it on the 166 abstracts of EMBL and Cell (Nov/Dec 2003).
  • ABNER: a software tool for molecular biology text analysis. At ABNER's core is a statistical machine learning system using linear-chain conditional random fields (CRFs) with a variety of orthographic and contextual features. ABNER 1.5 includes two models trained on the NLPBA and BioCreative corpora, for which performance is roughly state of the art (F1 scores of 70.5 and 69.9 respectively).

Corpus

Comments