Skip to content

gazzola/projects

ย 
ย 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

19 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Example projects

This repo contains example projects for various NLP tasks, including scripts, benchmarks, results and datasets created with Prodigy.

๐Ÿ’ Projects

Name Description
ner-food-ingredients Use sense2vec and Prodigy to bootstrap an NER model to detect ingredients in Reddit comments and to calculate how these mentions change over time. Includes an end-to-end video tutorial, raw pre-processed data, 949 annotated examples and pretrained tok2vec weights.
ner-fashion-brands Use sense2vec to bootstrap an NER model to detect fashion brands in Reddit comments. Includes 1735 annotated examples, a data visualizer, training and evaluation scripts for spaCy and pretrained tok2vec weights.
ner-drugs Use word vectors to bootstrap an NER model to detect drug names in Reddit comments. Includes 1977 annotated examples, a data visualizer, training and evaluation scripts for spaCy and pretrained tok2vec weights.
textcat-docs-issues Train a binary text classifier with exclusive classes to predict whether a GitHub issue title is about documentation. Includes 1161 annotated examples, a live demo and downloadable model and training and evaluation scripts for spaCy.

About

๐Ÿ’ Example projects for various NLP tasks with datasets, scripts and results

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 83.2%
  • Python 16.8%