Data annotation notes
✎ This article/section is a stub — some half-sorted notes, not necessarily checked, not necessarily correct. Feel free to ignore, or tell me about it.
Tools
Online, open source
label studio
- image, text
- browser based (your own hosted copy)
- https://labelstud.io/
doccano
- text
- browser based (your own hosted copy)
- https://doccano.github.io/doccano/
ML-Annotate
- text
- browser based (your own hosted copy)
- https://github.com/falcony-io/ml-annotate
brat
- text?
- browser based (your own hosted copy)
- http://brat.nlplab.org/
annotator.js
- text
- browser extension, meant to work on webpages
- http://annotatorjs.org/
Annotation Lab (a.k.a. NLP Lab)
- text(, also images?)
- https://nlp.johnsnowlabs.com/docs/en/alab/quickstart
Paid and/or closed source
(mostly online or self-hosted)
datagym
- image and video
- web based
- assisted labeling
- free/paid model
- open source
- https://www.datagym.ai/
LightTag
- text
- free start, mostly paid
- https://www.lighttag.io/
Label Your Data
- https://labelyourdata.com/
- closed source
- paid
prodigy
- text (including specific spacy things like pos, ner, dep); images, audio
- paid only?(verify) [1]
- https://prodi.gy/
LabelBox
- free for small data, mostly paid
- https://labelbox.com/
CVAT
- image and video
- paid; free is limited
- https://cvat.ai/
GUI, open source
LabelImg
- image
- Python, Qt (local install)
- open source
- https://github.com/heartexlabs/labelImg
MAE (Multi-document Annotation Environment)
- text
- GUI (Java)
- open source
- https://keighrim.github.io/mae-annotation/
- https://github.com/keighrim/mae-annotation
YEDDA
- text
- GUI app
- open source
- https://github.com/jiesutd/YEDDA
ELAN
- audio and video
- open source
- https://archive.mpi.nl/tla/elan
Praat
- audio
- open source
- specialized for phonetics/linguistics
- https://www.fon.hum.uva.nl/praat/
Phon
- audio
- specialized for phonetics/linguistics
- open source
- https://www.phon.ca/phon-manual/getting_started.html
- https://github.com/phon-ca/phon
Unsorted
ipyannotations
- text (images overall)
- python notebook
poplar
VGG Oxford University
- varied