Skip to content
@trec-kba

TREC KBA & StreamCorpus

common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text

Popular repositories Loading

  1. streamcorpus streamcorpus Public

    common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text

    Scala 35 19

  2. many-stop-words many-stop-words Public

    stop word lists in several languages

    Python 21 20

  3. streamcorpus-pipeline streamcorpus-pipeline Public

    framework for making streamcorpus data

    HTML 11 4

  4. kba-tools kba-tools Public

    Tools for working with TREC KBA entities, training data, and run submissions

    Python 5 2

  5. kba-corpus kba-corpus Public

    Tools for working with TREC KBA Corpora

    Python 4 4

  6. kba-stanford-corenlp kba-stanford-corenlp Public

    Wrappers for generating one-word-per-line output representing all the goodies from Stanford CoreNLP, so we can include it in the KBA stream corpus.

    Java 4

Repositories

Showing 10 of 14 repositories

Top languages

Loading…

Most used topics

Loading…