About
Inforex is a web system for text corpora construction. Inforex allows parallel access and sharing resources among many users. The system assists semantic annotation of texts on several levels, such as marking text references, creating new references, or marking word senses.

Main features
  • does not require installation — access through a web browser supporting JavaScript,
  • remote access to the data,
  • data sharing between users,
  • control of work progress,
  • advanced system of access control &mbash; by users and tasks,
  • supports several types of document description:
    • metadata,
    • content cleanup,
    • phrase annotation (a continous sequence of words/tokens),
    • phrase lemmatisation,
    • annotation linking.
  • inter-annotator agreement on the level of phrase annotation,
  • export documents to a ccl format,

Suggested browser: Chrome
Publications

Latest publication

Marcińczuk, M. & Oleksy, M. (2019). Inforex — a Collaborative Systemfor Text Corpora Annotation and Analysis Goes Open. In Proceedings of the International Conference on Recent Advances in Natural Language Processing, RANLP 2019, pages 711―719. Varna, Bulgaria. INCOMA Ltd.

@inproceedings{marcinczuk-oleksy-2019-inforex,
    title     = "{I}nforex {---} a Collaborative Systemfor Text Corpora Annotation and Analysis Goes Open",
    author    = "Marci{\'n}czuk, Micha{\l}  and
                Oleksy, Marcin",
    booktitle = "Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019)",
    month     = sep,
    year      = "2019",
    address   = "Varna, Bulgaria",
    publisher = "INCOMA Ltd.",
    url       = "https://www.aclweb.org/anthology/R19-1083",
    doi       = "10.26615/978-954-452-056-4_083",
    pages     = "711--719",
}
                        

Previous publications

Marcińczuk, M., Oleksy, M. & Kocoń, J. (2017). Inforex—a Collaborative System for Text Corpora Annotation and Analysis. In Mitkov, Ruslan, Angelova, Galia (editors), Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pages 473-482. Varna, Bulgaria. INCOMA Ltd.
@inproceedings{DBLP:conf/ranlp/MarcinczukOK17,
    author    = {Michal Marcinczuk and Marcin Oleksy and Jan Kocon},
    editor    = {Ruslan Mitkov and Galia Angelova},
    title     = {Inforex - a collaborative system for text corpora annotation and analysis},
    booktitle = {Proceedings of the International Conference Recent Advances in Natural
    Language Processing, {RANLP} 2017, Varna, Bulgaria, September 2-8, 2017},
    pages     = {473--482},
    publisher = {{INCOMA} Ltd.},
    year      = {2017},
    url       = {https://doi.org/10.26615/978-954-452-049-6_063},
    doi       = {10.26615/978-954-452-049-6_063},
    timestamp = {Tue, 09 Jan 2018 14:09:59 +0100},
    biburl    = {https://dblp.org/rec/bib/conf/ranlp/MarcinczukOK17},
    bibsource = {dblp computer science bibliography, https://dblp.org}
}
                        
Marcińczuk, M., Kocoń, J. && Broda, B (2012). Inforex — a web-based tool for text corpus management and semantic annotation. In Calzolari, N., Choukri, K., Declerck, T., Do\u{g}an, M. U., Maegaard, B., Mariani, J. et al (editors), Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC-2012), pages 224-230. Istanbul, Turkey : European Language Resources Association (ELRA).
@InProceedings{lMARCICZUK12.446,
    author = {Michał Marcińczuk and Jan Kocoń and Bartosz Broda},
    title = {Inforex -- a web-based tool for text corpus management and semantic annotation},
    booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)},
    year = {2012},
    month = {may},
    date = {23-25},
    address = {Istanbul, Turkey},
    editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
    publisher = {European Language Resources Association (ELRA)},
    isbn = {978-2-9517408-7-7},
    language = {english}
 }
        			
Contribution

Currently involved in the development

Involved in the past

  • Adam Kaczmarek,
  • Jan Kocoń,
  • Marcin Ptak,
  • Mikołaj Szewczyk.