Apertium

Identify and add 250 new entries to a bilingual dictionary

Our translation systems require large lexicons so as to provide production-quality coverage of any input data. This task requires the student to add 250 new words to a bidirectional dictionary. Choose one of the language pairs listed below, and with the help of your mentor, identify some text in one of the two languages, and run the text through Apertium's translator for that language pair to identify 250 unknown forms. As needed, add the stems of these forms to the individual languages' analysers in an appropriate way so that these words are analysed correctly. Your submission should be in the form of a pull request to each of the appropriate repositories on GitHub.

The language pairs we can mentor for this task are the following: English-Basque, English-French, English-Portuguese, English-Galician, English-Spanish, English-Irish, Catalan-English, Catalan-Basque, Catalan-French, Catalan-Portuguese, Catalan-Galician, Catalan-Spanish, Catalan-Irish, Basque-French, Basque-Portuguese, Basque-Galician, Basque-Spanish, Basque-Irish, French-Portuguese, French-Galician, French-Spanish, French-Irish, Portuguese-Spanish, Galician-Portuguese, Galician-Spanish, Irish-Portuguese, Irish-Galician, Irish-Spanish, Breton-English, Breton-Catalan, Breton-Basque, Breton-French, Breton-Portuguese, Breton-Galician, Breton-Spanish, Breton-Irish, Guaraní-Spanish, Guaraní-Portuguese, Guaraní-Russian, Portuguese-Russian, Russian-Spanish, English-Turkish, English-Uyghur, English-Tatar, English-Kurmancî Kurdish, English-Iranian Persian, Turkish-Uyghur, Azerbaijani-English, Azerbaijani-Turkish, Azerbaijani-Uyghur, Azerbaijani-Tatar, Azerbaijani-Crimean Tatar, Azerbaijani-Kurmancî Kurdish, Azerbaijani-Soranî Kurdish, Azerbaijani-Iranian Persian, Tatar-Turkish, Tatar-Uyghur, Crimean Tatar-English, Crimean Tatar-Turkish, Crimean Tatar-Uyghur, Crimean Tatar-Tatar, Crimean Tatar-Kurmancî Kurdish, Crimean Tatar-Iranian Persian, Kurmancî Kurdish-Turkish, Kurmancî Kurdish-Uyghur, Kurmancî Kurdish-Tatar, Soranî Kurdish-English, Soranî Kurdish-Turkish, Soranî Kurdish-Uyghur, Soranî Kurdish-Tatar, Soranî Kurdish-Crimean Tatar, Soranî Kurdish-Kurmancî Kurdish, Soranî Kurdish-Iranian Persian, Iranian Persian-Turkish, Iranian Persian-Uyghur, Iranian Persian-Tatar, Iranian Persian-Kurmancî Kurdish, English-Norwegian, English-Russian, English-Swedish, English-Gagauz, Spanish-Swedish, Spanish-Turkish, Catalan-Norwegian, Catalan-Russian, Catalan-Swedish, Catalan-Turkish, Catalan-Gagauz, French-Norwegian, French-Russian, French-Swedish, French-Turkish, French-Gagauz, Norwegian-Spanish, Norwegian-Russian, Norwegian-Portuguese, Norwegian-Swedish, Norwegian-Turkish, Russian-Swedish, Russian-Turkish, Portuguese-Swedish, Portuguese-Turkish, Swedish-Turkish, Gagauz-Spanish, Gagauz-Norwegian, Gagauz-Russian, Gagauz-Portuguese, Gagauz-Swedish, Gagauz-Turkish, Azerbaijani-Spanish, Azerbaijani-Catalan, Azerbaijani-French, Azerbaijani-Norwegian, Azerbaijani-Russian, Azerbaijani-Portuguese, Azerbaijani-Swedish, Azerbaijani-Gagauz, English-Kazakh, English-Uzbek, English-Kumyk, Arabic-English, Arabic-Turkish, Arabic-Kazakh, Arabic-Azerbaijani, Arabic-Tatar, Arabic-Gagauz, Arabic-Uyghur, Arabic-Uzbek, Arabic-Crimean Tatar, Arabic-Kumyk, Turkish-Uzbek, Kazakh-Turkish, Kazakh-Tatar, Kazakh-Uyghur, Kazakh-Uzbek, Kazakh-Kumyk, Azerbaijani-Kazakh, Azerbaijani-Uzbek, Azerbaijani-Kumyk, Tatar-Uzbek, Gagauz-Kazakh, Gagauz-Tatar, Gagauz-Uyghur, Gagauz-Uzbek, Gagauz-Kumyk, Uyghur-Uzbek, Crimean Tatar-Kazakh, Crimean Tatar-Gagauz, Crimean Tatar-Uzbek, Crimean Tatar-Kumyk, Kumyk-Turkish, Kumyk-Tatar, Kumyk-Uyghur, Kumyk-Uzbek, Russian-Sakha, English-Sakha, Afrikaans-German, Afrikaans-English, German-English, English-Mandarin Chinese, Catalan-Moldovan, English-Moldovan, Moldovan-Spanish, English-Malayalam, English-Hindi, Hindi-Malayalam, English-Kyrgyz, English-Qaraqalpaq, English-Noghay, English-Halh Mongolian, English-Yiddish, Spanish-Tatar, Spanish-Uzbek, Spanish-Uyghur, Spanish-Yiddish, French-Kazakh, French-Kyrgyz, French-Qaraqalpaq, French-Tatar, French-Kumyk, French-Noghay, French-Uzbek, French-Uyghur, French-Halh Mongolian, French-Yiddish, Russian-Tatar, Russian-Uzbek, Russian-Uyghur, Russian-Yiddish, Turkish-Yiddish, Gagauz-Kyrgyz, Gagauz-Qaraqalpaq, Gagauz-Noghay, Gagauz-Halh Mongolian, Gagauz-Yiddish, Azerbaijani-Kyrgyz, Azerbaijani-Qaraqalpaq, Azerbaijani-Bashqort, Azerbaijani-Noghay, Azerbaijani-Halh Mongolian, Azerbaijani-Yiddish, Kazakh-Spanish, Kazakh-Russian, Kazakh-Kyrgyz, Kazakh-Noghay, Kazakh-Halh Mongolian, Kazakh-Yiddish, Kyrgyz-Spanish, Kyrgyz-Russian, Kyrgyz-Turkish, Kyrgyz-Tatar, Kyrgyz-Kumyk, Kyrgyz-Noghay, Kyrgyz-Uzbek, Kyrgyz-Uyghur, Kyrgyz-Yiddish, Qaraqalpaq-Spanish, Qaraqalpaq-Russian, Qaraqalpaq-Turkish, Qaraqalpaq-Kazakh, Qaraqalpaq-Kyrgyz, Qaraqalpaq-Tatar, Qaraqalpaq-Kumyk, Qaraqalpaq-Noghay, Qaraqalpaq-Uzbek, Qaraqalpaq-Uyghur, Qaraqalpaq-Halh Mongolian, Qaraqalpaq-Yiddish, Tatar-Yiddish, Bashqort-English, Bashqort-Spanish, Bashqort-French, Bashqort-Russian, Bashqort-Turkish, Bashqort-Gagauz, Bashqort-Kazakh, Bashqort-Kyrgyz, Bashqort-Qaraqalpaq, Bashqort-Tatar, Bashqort-Kumyk, Bashqort-Noghay, Bashqort-Uzbek, Bashqort-Uyghur, Bashqort-Crimean Tatar, Bashqort-Halh Mongolian, Bashqort-Yiddish, Kumyk-Spanish, Kumyk-Russian, Kumyk-Noghay, Kumyk-Yiddish, Noghay-Spanish, Noghay-Russian, Noghay-Turkish, Noghay-Tatar, Noghay-Uzbek, Noghay-Uyghur, Noghay-Yiddish, Uzbek-Yiddish, Uyghur-Yiddish, Crimean Tatar-Spanish, Crimean Tatar-French, Crimean Tatar-Russian, Crimean Tatar-Kyrgyz, Crimean Tatar-Qaraqalpaq, Crimean Tatar-Noghay, Crimean Tatar-Halh Mongolian, Crimean Tatar-Yiddish, Halh Mongolian-Spanish, Halh Mongolian-Russian, Halh Mongolian-Turkish, Halh Mongolian-Kyrgyz, Halh Mongolian-Tatar, Halh Mongolian-Kumyk, Halh Mongolian-Noghay, Halh Mongolian-Uzbek, Halh Mongolian-Uyghur, Halh Mongolian-Yiddish.

More instructions for this task here...

Task tags

  • xml
  • dictionaries

Students who completed this task

Maathavan, albertonl

Task type

  • done_all Quality Assurance
close

2019