Data

Text+ is a consortium of the National Research Data Infrastructure (NFDI) in Germany and supports all researchers who work with text and language data in a broader sense, including but not limited to linguistics, literary studies, philology, the so-called “minor subjects”, philosophy and language and text-based research in the social sciences and political science.

The institutions involved in Text+ provide extensive resources for research. As a Data Space, it relies on an established infrastructure for the provision and inclusion of text- and language-based research data. Our comprehensive information and reference system in form of the Text+ Registry enables a search across the entire stock of resources. The Federated Content Search (FCS) provides a full-text search within the text and language data. Both services will utilize the potential for improved precision & recall of results by using the controlled vocabulary of the Integrated Authority File (GND). By using the GND authority data, all entries that refer to each other are linked in the Registry. In the Federated Content Search (FCS), potentially ten million GND entities offer unique search entries and thus improve the search results. A comprehensive linking of the supplied data with GND IDs fully exploits this potential and increases the visibility of the content. Additional GND identifiers for entities that are not yet included in the GND can be added and existing ones edited via the GND Agency Text+.


 

Search for resources - The Text+ Registry

The Text+ Registry is a comprehensive information and reference system that provides easy access to resources relevant for research and teaching across the spectrum of language and text-based research disciplines.

The registry lists corpora and text collections, editions and lexical resources, taking into account the needs of the respective subject communities. It aggregates resources from all Text+ centers and brings them together as a central hub.

For editions, the Text+ Registry functions as an open register: On the one hand, relevant catalogues and reference systems are integrated. On the other hand, printed, hybrid and digital editions can be entered manually on the part of the community and thus made findable - regardless of whether they are provided by a Text+ Center.

By using different entities (e.g. persons via GND), all recorded resources are linked and related to each other. By enriching data and making it available via interfaces, comprehensive connectivity to NFDI wide services (e.g. knowledge graphs) is guaranteed.

 


 

The search for text and language data uses the Federated Content Search (FCS), an established search platform that allows accessing distributed resources using a common specification of technical interfaces, data formats, and query languages. The framework was originally developed in the European CLARIN project and is based on standard protocols like OASIS searchRetrieve and the Contextual Query Language (CQL).

Various extensions are under development as part of Text+. These include the extended support of Lexical Resources, entity search, as well as the simplified integration of FCS in new application contexts.

The search platform can be used both as a stand-alone web application (FCS Aggregator) and via the integration into the Text+ website.


 

Search within the website

To search for content on this website, select “Website” in the header of the search mask. The following filters are available: Website Categories, Task Areas and DFG Subject Areas. Search results are updated as you type and are organized according to the website structure. A chain symbol next to a result means that this is a direct link to a specific hit page. A number indicates how many hits have been found in a category. The entry can be expanded by clicking on it to access the individual hit pages.

Example search on the website, search term letters, filter task area Collections