Tools and Services

Text+ and the participating institutions offer a wide range of services related to language and text data. In addition to research data, services and tools are an essential part of the Text+ offerings for users.

For the provision and maintenance of this service overview, Text+ uses the SSH Open Marketplace, a discovery platform from the Social Sciences and Humanities Cluster within the EOSC.

The list of offerings mentioned below is subject to constant development and expansion. Both a search function and a filter function are implemented for convenient access. Feedback and requests can be directed to the Helpdesk with the subject Infrastructure/Operations.

Activities
Keywords
Thumbnail for service BAS CLARIN Repository
BAS CLARIN Repository

Repository of 50+ annotated speech corpora. Most corpora may be accessed and downloaded by members of academic research institutions, some corpora require licenses, e.g. highly sensitive data, or data for commercial use.

Infrastructure, Textplus
Thumbnail for service BAS Services
BAS Services

, Textplus
Thumbnail for service BAS Web Services
BAS Web Services

Webservice/Download, Textplus
Thumbnail for service CLARIAH-DE Tutorial Finder
CLARIAH-DE Tutorial Finder

The Tutorial Finder allows users to browse freely available and reusable teaching and training materials on procedures, tools, research methods, and topics in the humanities and its related disciplines.

This resource is supported by Text+.

Searching
, DARIAH Resource, Textplus
Thumbnail for service CLARIND-UdS
CLARIND-UdS

CLARIND-UdS is a repository for language resources at Saarland University.

, Textplus
Thumbnail for service CLARIND-UdS FCS Endpoint (Saarbrücken)
CLARIND-UdS FCS Endpoint (Saarbrücken)

The CLARIND-UdS data center is part of the Text+ infrastructure and operated by the Department of Language Science and Technology at Saarland University.

Searching
FCS-Endpoint, Textplus
Thumbnail for service CLARIND-UdS Repository (Saarbrücken)
CLARIND-UdS Repository (Saarbrücken)

The CLARIND-UdS data center is part of the Text+ infrastructure and operated by the Department of Language Science and Technology at Saarland University.

Archiving Publishing
Infrastructure, Textplus
Thumbnail for service CMDI Explorer
CMDI Explorer

CMDI Explorer is a tool that empowers users to easily explore the contents of complex CMDI records and to process selected parts of them with little effort.

Webservice/Download, Textplus
Thumbnail for service Corpus Services
Corpus Services

The (HZSK) Corpus Services were initially developed at the Hamburg Centre for Language Corpora (HZSK) as a quality control and publication framework for EXMARaLDA corpora. Since then, most development work has been done within the INEL project.

Analyzing
, Textplus, curation, conversion
Thumbnail for service correspSearch - Search and Connect Scholarly Editions of Correspondence
correspSearch - Search and Connect Scholarly Editions of Correspondence

With correspSearch you can search within the metadata of diverse edited letters from different scholarly editions and other scholarly publications. One can search according to the letter's sender, adressee, as well as place and date of the letter's creation.

Searching Annotating
Infrastructure, letters, letter-writing, Textplus
Thumbnail for service Critical Pāli Dictionary Online
Critical Pāli Dictionary Online

The Critical Pāli Dictionary Online (CPD) is a digital version of the "Critical Pāli Dictionary", which is a comprehensive dictionary of the Pāli language.

Searching
Infrastructure, Textplus, Indo-Aryan, South Asia, Theravada, dictionary
Thumbnail for service DARIAH-CAMPUS
DARIAH-CAMPUS

Looking for learning resources?

Publishing Communicating Searching
, Education & Training, DARIAH Core Service, ATRIUM catalogue, Textplus
Thumbnail for service DARIAH-DE and OPERAS-GER academic blogging with Hypotheses
DARIAH-DE and OPERAS-GER academic blogging with Hypotheses

Hypotheses is a non-commercial blog portal for the humanities and social sciences. The portal provides a free service that facilitates scientific blogging and ensures greater visibility as well as archiving of content.

Communicating Publishing
, DARIAH Resource, OPERAS Service, Textplus, GKFI e.V.
Thumbnail for service DARIAH-DE Data Federation Architecture (DFA)
DARIAH-DE Data Federation Architecture (DFA)

The DARIAH-DE data federation architecture is the term for services and tools that enable research data and collection descriptions to be found from various sources (such as cultural institutions, libraries, archives, research facilities, and data centers) and to be used for analysis.

Searching Annotating
, Research infrastructures, Textplus
Thumbnail for service DARIAH-DE Data Modeling Environment
DARIAH-DE Data Modeling Environment

Environment for modeling data and their relationships.

The Data Modeling Environment (DME) is a tool for modeling and associating data. By means of the DME, data models and mappings between them are defined and provided in terms of interfaces (REST-API).

Annotating
, DARIAH Resource, Textplus, GKFI e.V.
Thumbnail for service DARIAH-DE Generic Search
DARIAH-DE Generic Search

Search engine that allows to search in the metadata records of the Collection Registry

The Generic Search creates a comprehensive search facility in DARIAH-DE.

Searching
, DARIAH Resource, Textplus, GKFI e.V.
Thumbnail for service DARIAH-DE Geo-Browser
DARIAH-DE Geo-Browser

The DARIAH-DE Geo-Browser (or GeoBrowser) allows a comparative visualisation of several requests and facilitates the representation of data and their visualisation in a correlation of geographic spatial relations at corresponding points of time and sequences.

Annotating Analyzing
Infrastructure, DARIAH Resource, ATRIUM catalogue, Textplus, GKFI e.V., tool
Thumbnail for service DARIAH-DE Helpdesk (DARIAH-EU, CLARIAH-DE, Text+)
DARIAH-DE Helpdesk (DARIAH-EU, CLARIAH-DE, Text+)

A good starting point to receive support for DH-related questions, tools and resources provided by CLARIN-D, DARIAH-EU and DARIAH-DE, CLARIAH-DE and Text+ is the helpdesk.

Communicating
, DARIAH Core Service, Textplus, Support, Textplus, GKFI e.V.
Thumbnail for service DARIAH-DE Monitoring of research infrastructures and services using Icinga
DARIAH-DE Monitoring of research infrastructures and services using Icinga

Monitoring is a important factor for the operation of a digital research infrastructure. The data centers focus the hardware and the state of the basic software. Monitoring can be used to correct any faults and failures as quickly as possible.

, DARIAH Resource, Research infrastructures, Textplus, GKFI e.V.
Thumbnail for service DARIAH-DE Publikator
DARIAH-DE Publikator

For whom? Researchers who want to deposit their research data safe, persistent, and referencable in a research data repository.

The DARIAH-DE Publikator offers the possibility to prepare, manage and import research data for the import into the DARIAH-DE Repository.

Publishing Archiving
Webservice/Download, DARIAH Resource, Textplus, GKFI e.V.
Thumbnail for service DARIAH-DE Repository dhrep
DARIAH-DE Repository dhrep

The DARIAH-DE Repository is a central component of the DARIAH-DE research data federation architecture. The DFA aggregates various services and, thus, ensures a convenient use.

Archiving Publishing
Infrastructure, Research infrastructures, DARIAH Resource, Textplus, GKFI e.V.
Thumbnail for service Deutsches Textarchiv (DTA)
Deutsches Textarchiv (DTA)

The German Text Archive is a Repository for historical, German-language text corpora at Zentrum Sprache of the Berlin-Brandenburg Academy of Sciences and Humanities.

Publishing Archiving
Infrastructure, Textplus, Academic corpora
Thumbnail for service Digital Collection of Germany National Library
Digital Collection of Germany National Library

Our digital collections consist of e-books, e-papers and e-journals, online dissertations, audio books, digitally recorded music, websites and digitised works.

Searching
Infrastructure, Textplus
Thumbnail for service Digital Humanities Call
Digital Humanities Call

Digital Humanities Call (starting every year in March): We will gladly support you with your research project by providing metadata, digital objects and infrastructures. The provisions in section 60d UrhG apply.

Communicating
, Textplus, Support, Metadata
Thumbnail for service Digitales Wörterbuch der deutschen Sprache (DWDS)
Digitales Wörterbuch der deutschen Sprache (DWDS)

The Digital Dictionary of the German Language (DWDS) is a lexical system at the Berlin-Brandenburg Academy of Sciences and Humanities that provides information about German vocabulary in the past and present.

Infrastructure, Textplus
Thumbnail for service DNBLab
DNBLab

The German National Library offers free access to its bibliographic data and several collections of digital objects. As the central access point for presenting, accessing and reusing digital resources, DNBLab allows users to access our data, object files and full texts.

Analyzing
Infrastructure, Textplus
Thumbnail for service DTA Base Format (DTABf)
DTA Base Format (DTABf)

Format standard for TEI-compliant text annotation of digital full texts of historical prints with an extension for manuscripts.

Annotating
Format standard, Textplus
Thumbnail for service DTA FCS-Endpoint
DTA FCS-Endpoint

Endpoint for the Text+ Federated Content Search to query DTAXL, the historical corpora of the DTA at BBAW.

Searching
FCS-Endpoint, Textplus
Thumbnail for service DWDS FCS-Endpoint
DWDS FCS-Endpoint

Endpoint for the Text+ Federated Content Search to query DWDSXL, a collection of contemporary text corpora at BBAW.

Searching
FCS-Endpoint, Textplus
Thumbnail for service DWDS word history curve
DWDS word history curve

Tool of the Digital Dictionary of the German Language (DWDS) at the Berlin-Brandenburg Academy of Sciences and Humanities (BBAW) for the diachronic analysis of word usages.

Analyzing Searching
Infrastructure, Textplus
Thumbnail for service DWDS-API
DWDS-API

An application programming interface (API) for querying DWDS-corpora at Berlin-Brandenburg Academy of Sciences and Humanities (BBAW). It's the API interface of the Digital Dictionary of the German Language (DWDS).

Searching
, Textplus
ediarum

With ediarum researchers can comfortably transcribe, encode and edit manuscripts in TEI-XML, as well as publish their results in an online or print edition. The solution, developed by TELOTA, is based on three software components: exist-db, Oxygen XML Author, and ConTeXt.

, xml, manuscripts, exist-db, print edition, digital edition, Textplus
Thumbnail for service EmuR
EmuR

The main R package for the EMU Speech Database Management System (EMU-SDMS). R package to integrate signal processing, customized graphical output and interactive phonetic segmentation into the R statistics package.

, Textplus
Thumbnail for service entityXML
entityXML

entityXML is (so far) a concept study in version 0.6.5 (BETA), which aims to model a standardised XML-based data format for the GND Agency Text+.

Annotating
Format standard, Textplus
Thumbnail for service GermaNet API
GermaNet API

GermaNet can be accessed via an API in Python and Java. The APIs allow for advanced search configuration (regular expressions, processing distance, selected word types, word classes, orthographic forms) as well as semantic similarity calculations using six different methods.

, Textplus
Thumbnail for service GermaNet FCS Endpoint
GermaNet FCS Endpoint

The GermaNet FCS endpoint delivers GermaNet-based results to the FCS query language.

FCS-Endpoint, Textplus
GermaNet Rover

Rover is an online application that can be used to access GermaNet data or to calculate the semantic relationship/similarity between two synsets. It offers visualizations and advanced filter options for synset searches.

Webservice/Download, Textplus
Thumbnail for service GND Agency Text+
GND Agency Text+

The GND Agency Text+ is a service that is being set up at the Göttingen State and University Library (SUB Göttingen) as part of the NFDI consortium Text+.

Annotating
, Textplus
Thumbnail for service GND-Explorer
GND-Explorer

Informative, visual, interconnected: The GND Explorer will be the new tool for presenting and searching the Integrated German authority file (GND)! In the future, the GND explorer should provide a convenient and comprehensive access to the GND and its semantic network for all users.

Searching Analyzing
Infrastructure, Textplus, authority-files, linked data (places, persons, bible citations, etc.)
Thumbnail for service HedgeDoc - GWDG Pad
HedgeDoc - GWDG Pad

Online tool for collaborative text editing to work together on the same texts at the same time.

The HedgeDoc-pad is an open-source-based web editor that allows multiple users to work on a single text simultaneously from different locations.

Communicating
, DARIAH Resource, Textplus, GKFI e.V.
Thumbnail for service IDS Repository
IDS Repository

The IDS Repository deals with the long-term archiving of linguistic and language resources in the field of German studies and serves as an interface to the Virtual Language Observatory, where the data can be explored via a faceted search.

Searching Archiving
, Textplus, long-term archiving, language resources, linguistic resources
Thumbnail for service Indico Event Management
Indico Event Management

The open source software Indico developed by Cern is a web application. Lectures, meetings and conferences can be created using Indico. Three different event types (lecture, meeting and conference) can be created in Indico.

Communicating
, DARIAH Resource, Textplus, GKFI e.V.
Thumbnail for service Integrated Authority File (GND)
Integrated Authority File (GND)

The Integrated Authority File (GND) is a service facilitating the collaborative use and administration of authority data. These authority data represent and describe entities, i.e.

Annotating
, Textplus, authority-files, linked data (places, persons, bible citations, etc.), Impact TPDL 2024 workshop
Thumbnail for service KorAP FCS-Endpoint
KorAP FCS-Endpoint

Searching
FCS-Endpoint, Textplus
Thumbnail for service Language Archive Cologne (LAC)
Language Archive Cologne (LAC)

The Language Archive Cologne (LAC) supports research, learning and teaching with high quality and dependable digital language resources. The LAC facilitates free and open online access to research data.

Publishing Archiving
, Textplus, CLARIN
Thumbnail for service Language Resource Switchboard
Language Resource Switchboard

A web application that suggests language analysis tools for specific data sets.

Webservice/Download, SSHOC Key Exploitable Result, ATRIUM catalogue, Textplus, Public
MeineDGS - ANNIS

ANNIS Instances of the Public German Sign Language Corpus, created by at Institute of German Sign Language and Communication of the Deaf in the framework of the longterm project DGS-Corpus.

DGS-Corpus is a long-term project by the Academy of Sciences in Hamburg.

Analyzing Annotating
, Textplus
Thumbnail for service Metadata Service
Metadata Service

The metadata provided by the German National Library includes current and retrospective bibliographic data for individual series in the German National Bibliography, authority data from the Integrated Authority file, metadata from the German Union Catalogue of Serials, and data on new releases.

Searching
, Textplus
Thumbnail for service MONAPipe
MONAPipe

MONAPipe stands for "Modes of Narration and Attribution Pipeline". It provides natural-language-processing tools for German, implemented in Python/spaCy.

Publishing Annotating
Webservice/Download, DARIAH Resource, Textplus
Thumbnail for service Notebook Actions - TextGrid Import UI
Notebook Actions - TextGrid Import UI

Archiving
, Textplus, DARIAH Resource
Thumbnail for service Octra Backend
Octra Backend

Web-based management tool for organizing transcription projects: audio files, transcribers, tools and assignments are managed in a graphical user interface.

Webservice/Download, Textplus
Thumbnail for service OCTRA Transcription Editor
OCTRA Transcription Editor

Web-based editor for orthographic transcripts. Octra provides different views of signal and transcript, supports a flexible organization of work, and provides tools to split signal files into segment-sized chunks.

Webservice/Download, Textplus
Thumbnail for service OpenProject
OpenProject

The Project Management Service is a collaboration self service that allows you to manage and track your projects and source code repositories. By using DARIAH-DE OpenProject, users can independently coordinate their projects, keep track of their issues and document their results.

Communicating
, DARIAH Resource, Textplus, GKFI e.V.
Thumbnail for service OWID FCS-Endpoint
OWID FCS-Endpoint

Searching
FCS-Endpoint, Textplus, lexical analysis, LexicalData
Thumbnail for service Percy Web-Experiment
Percy Web-Experiment

, Textplus
Thumbnail for service Persistent Identifier Service
Persistent Identifier Service

In all aspects of research, the amount of digitally stored data is increasing continuously. Thereby the management will be more and more complex, so that the sustainable reference to data and thus their permanent censibility represent a challenge.

Publishing
, DARIAH Resource, Textplus, GKFI e.V.
Thumbnail for service perspectivia.net
perspectivia.net

Scientific publications of the Max Weber Foundation (MWS, Max Weber Stiftung) and its partners can be published free of charge on the OA publication platform perspectivia.net in the sense of Diamond Open Access.

Publishing
, Textplus, DARIAH Resource
Thumbnail for service Russian Regions Acoustic Speech Database
Russian Regions Acoustic Speech Database

The Russian Regions Acoustic Speech Database (RuReg) is a collection of speech recordings from various regions of Russia. Rureg aims to capture the diversity of accents, dialects, and speech characteristics across different regions of Russia.

Analyzing
Infrastructure, Textplus, Russian dialects, Russian speech varieties, Speech Database
Thumbnail for service SpeechRecorder
SpeechRecorder

Platform independent tool for performing scripted recordings. Flexible organisation of scripts into sections and groups for sequential or ordered recordings. Each utterance is saved in a separate file.

Webservice/Download, Textplus
Thumbnail for service Text+ Curated Tool Platform for Editions (CSP)
Text+ Curated Tool Platform for Editions (CSP)

Version 1.0 of a curated software platform for scholarly editions developed in Text+/NFDI is now available. The platform improves the visibility of software (and its authors) used in the field of scholarly editions.

Searching
, Textplus
Thumbnail for service Text+ Federated Content Search (FCS)
Text+ Federated Content Search (FCS)

The Federated Content Search (FCS) is a specification and technical infrastructure for querying and aggregating distributed research data.

Searching
FCS-Endpoint, Textplus
Thumbnail for service Text+ GitLab
Text+ GitLab

Web-based source code management with a wide range of functionalities to support development processes. Configuration of continuous integration per project. Support for the merge request workflows. Consultancy and support in setting up your projects. Connection to the GWDG user administration.

Archiving
, Textplus
Thumbnail for service Text+ Web Portal
Text+ Web Portal

Web portal for Text+ based on HugoCMS and a CI/CD deployment pipeline. This resource is supported by Text+. In case of questions you may get in touch with the Text+ helpdesk at textplus-support@gwdg.de.

, Textplus
Thumbnail for service Text+ Zenodo Community
Text+ Zenodo Community

The Text+ Zenodo Community gathers a growing collection of affiliated research outputs, guidelines, project deliverables and other documents affiliated with Text+, the NFDI consortium for text- and langeuage-related research data.

Publishing
, Textplus
Thumbnail for service TextGrid Repository & Laboratory
TextGrid Repository & Laboratory

TextGrid is a virtual research environment for text-based humanities scholarship. It offers a variety of tools and services for collaboratively creating, analyzing, editing, and publishing texts.

Archiving Searching Annotating Communicating Publishing
, Research infrastructures, cloud storage, research, VRE, grid, DARIAH Resource, Textplus, GKFI e.V.
Thumbnail for service tg-model - TextGrid Import Modeller
tg-model - TextGrid Import Modeller

Whats the aim? This project focuses on attemps for a simple import of text corpora (encoded in XML/TEI) to TextGrid Repository by modeling the required metadata file structure.

Archiving
, Textplus, DARIAH Resource
Thumbnail for service tgadmin - TextGrid repository administration cli tool, based on tgclients
tgadmin - TextGrid repository administration cli tool, based on tgclients

What is the aim? A command line tool for managing your projects in the TextGrid repository without TextGridLab.

The actual data import is finally carried out by the Python tools tgadmin and tgclients, which in turn communicate with the TextGrid backend via the various TextGridRep APIs.

Archiving
, Textplus, DARIAH Resource
Thumbnail for service tgclients - TextGrid Python clients
tgclients - TextGrid Python clients

What is the aim?

The actual data import is finally carried out by the Python tools tgadmin and tgclients, which in turn communicate with the TextGrid backend via the various TextGridRep APIs.

Archiving
, Textplus, DARIAH Resource
TSAKorpus Hosting

Hosting of instances of multimodal/spoken corpora ANNIS Instances Language Corpora (i. e. multimodal Corpus of German Sign Language).

, Textplus
Thumbnail for service Tübinger Treebank Collection
Tübinger Treebank Collection

, Textplus
Thumbnail for service TüNDRA
TüNDRA

TüNDRA is a web application for searching in 1004 treebanks for 173 languages such as treebanks for German and the full set of Universal Dependency treebanks. TüNDRA uses a lightweight query language based on TIGERSearch application.

Searching
, Textplus, CLARIN
TüNDRA Treebanks FCS Endpoint

FCS Endpoint which allows users to query TüNDRA treebanks from the Text+ Federated Content Search Aggregator. It does this by translating the FCS queries into the TüNDRA query language and returning the results.

Searching
FCS-Endpoint, Textplus
Thumbnail for service WebLicht
WebLicht

WebLicht is an execution environment for automatic annotation of text corpora. Linguistic tools such as tokenizers, part of speech taggers, and parsers are encapsulated as web services, which can be combined by the user into custom processing chains.

Analyzing Annotating
, tokenizing, corpora, Textplus
Thumbnail for service Weblicht as a Service
Weblicht as a Service

WebLicht as a Service (WaaS) is a REST service that executes WebLicht chains. This allows you to run WebLicht chains from your UNIX shell, scripts, or programs.

There are several advantages of using WaaS rather than making direct requests to WebLicht web services.

Webservice/Download, Textplus
Thumbnail for service WebLicht Batch
WebLicht Batch

WebLicht-Batch is a web-based interface to WebLicht’s chainer back-end. WebLicht-Batch helps users to automatically feed large input data, or input data of multiple files into WebLicht.

, Textplus
Thumbnail for service WebLicht Const Parsing EN
WebLicht Const Parsing EN

WebLicht Easy Chain for Constituency Parsing (English). The pipeline makes use of WebLicht's TCF converter, the Stanford tokenizer, and the statistical BLLIP/Charniak parser.

Analyzing
, Textplus, CLARIN
Thumbnail for service WikiSpeech
WikiSpeech

Web-based managment tool for scripted speech recordings via the Internet based on SpeechRecorder scripts.

What's it all about?

WikiSpeech is a content management system for the web-based creation of speech databases for the development of spoken language technology and basic research.

Webservice/Download, Textplus

Guidelines for the description of Text+ services on the SSH Open Marketplace

This short guideline documents how services affiliated with Text+ may be described in the SSH Open Marketplace.

Service definition

The collection of resources on this website includes, in addition to genuine Text+ developments, further offerings from partners contributing to Text+. How services become part of the Text+ portfolio is addressed in the Text+ Services Policy, Version 0.9, which is subject to ongoing internal project discussions and further development.

Changelog

  • upcoming: differentiation between genuine Text+ offerings and other offerings relevant to the community; addition of funding references in the service descriptions
  • October 2024: Expansion of the collection to 79 resources. Enhancement and curation of existing contributions. Update of the Text+ Services Policy to v0.9.
  • July 2024: linking of the guide & description guidelines for Text+ services in the SSH Open Marketplace; addition of filtering by categories and keywords
  • June 2024: expansion to 35 entries as well as partial revision of individual descriptions and the introductory text on this page
  • May 2024: initial version of a service list with 29 entries