Skip to content

Data for the paper "Machine Translation Between High-resource Languages in a Language Documentation Setting" (@FieldMatters2022)

Notifications You must be signed in to change notification settings

Kelina/MT4LanguageDocumentation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Machine Translation 4 Language Documentation

This repo contains the data for the paper "Machine Translation Between High-resource Languages in a Language Documentation Setting" (@FieldMatters2022).

Specifically, it contains English--Portuguese MT datasets that can be used to benchmark models for the translation of transcriptions of informal, colloquial speech. There are development and test files, training is supposed to happen on other datasets.

About

Data for the paper "Machine Translation Between High-resource Languages in a Language Documentation Setting" (@FieldMatters2022)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published