LREC Workshop: Leveraging Derived Text Formats to Unlock Copyrighted Collections for Open Science

Palma de Mallorca
11.  – 16.
Mai 2026

Call for Papers

The workshop Leveraging Derived Text Formats to Unlock Copyrighted Collections for Open Science will be held at the Language Resources and Evaluation Conference (LREC 2026).

Derived Text Formats (DTF), also known as extracted features, offer a promising solution for enabling research on textual data that cannot be shared in its original form due to copyright or privacy restrictions. This workshop brings together researchers, legal experts, and infrastructure providers to explore the creation, standardization, legal framing, and scientific use of derived data in linguistics, digital humanities, and language technology.

We invite contributions from the community that address practical experiences, challenges, and solutions related to:

  • The creation and processing of DTF
  • Legal and ethical considerations in publishing derived data
  • Use cases from digital humanities, linguistic research, corpus linguistics, or NLP
  • Infrastructure and tools supporting DTF flows
  • Standardization efforts (e.g., TEI, SynAF, MAF, ISO standards)

The workshop will be held as a hybrid event. The exact workshop date will be communicated in due time.

Submission Format

Submissions should be 4 to 8 pages in length (excluding references and potential Ethics Statements). Submissions should follow the LREC stylesheet, available on the conference website on the Author’s kit page. Submissions will be reviewed by the workshop organizers and the programme committee.

Important Dates

  • Submission Deadline: 20 February 2026
  • Reviewing period: 21 February 2026 – 10 March 2026
  • Notification of Acceptance: 11 March 2026
  • Camera Ready paper submission Deadline: 30 March 2026
  • Workshop Date: 11, 12 or 16 May, 2026

Submission

Sumbission will be handled via the conference Softconf/START system. The link will be provided here when it becomes available.

When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.) to enable their reuse and replicability of experiments (including evaluation ones)

Workshop Organisers

  • Florian Barth, Göttingen State and University Library
  • Keli Du, University of Trier
  • José Calvo Tello, Göttingen State and University Library
  • Philippe Genêt, German National Library
  • Piroska Lendvai, Bavarian Academy of Sciences and Humanities
  • Christof Schöch University of Trier
  • Thorsten Trippel, University of Tübingen and Leibniz-Institut für Deutsche Sprache

Programme Committee

tba

Contact

For questions, please contact: dtf-at-lrec2026@googlegroups.com

Veranstaltungsseite / Anmeldung

zuletzt aktualisiert: 01.12.2025