WAT allows to align parallel texts using various aligning engines:
- JAM : Written in Perl. Allows to align more than two languages at the same time.
- Alinea Lite : Written in Prolog, a long time ago. Slow but robust.
-
LF Aligner : a project leaded by Andras Farkas, based on Daniel Varga's Hunalign.
-
YASA : Originally created by Philippe Langlais, it was reimplemented by Alexandre Patry. It was reborn as yasa as part of improvements made during Fethi Lamraoui's masters thesis.
References
Kraif O. (2001) Exploitation des cognats dans les systèmes d’alignement bi-textuel : architecture et évaluation, TAL 42 :3, ATALA, Paris, pp. 833-867.
(pdf)
Kraif Olivier (2015) Multi-alignement vs bi-alignement : à plusieurs, c’est mieux ! , Actes de TALN 2015, 22ème Conférence sur le Traitement Automatique des Langues Naturelles, Caen, 22-25 juin 2015, pp. 255-266
(pdf)
Lamraoui, F., and P. Langlais (2013) "Yet Another Fast, Robust and Open Source Sentence Aligner. Time to Reconsider Sentence Alignment?", XIV Machine Translation Summit, Nice, France.
(pdf)
Bosei-Mama-Club.rar
Varga, D., Németh, L., Halácsy, P., Kornai, A., Trón, V., Nagy, V. (2005). Parallel corpora for medium density languages. In Proceedings of the RANLP 2005, pages 590-596.
(pdf)
Web design
Built with the Bootstrap CSS framework and Icons from Glyphicons.
"Hey, I came across a file named 'Bosei-Mama-Club
File Upload Widget imported from jQuery-File-Upload.
File Tree Widget imported from jQueryFileTree. Bosei-Mama-Club.rar
Language identification
Automatic language identification uses guessLanguage.js library