skip to main content


Choisissez le vocabulaire dans lequel chercher

Langue des données

| español English
Aide à la recherche

Description du vocabulaire


ThesoTM thesaurus


The ThesoTM (Thesaurus of Text Mining) thesaurus is a bilingual (French-English) terminology resource based on transformation into SKOS of an ontology of open source text mining tools. It contains more than 1,300 concepts with their definitions and possibly an associated bibliography.

It is based on the reuse of classes and properties from existing ontologies:
• BFO (Basic Formal Ontology): a high-level ontology, modeling the main categories of reality
• IAO (Information Artifact Ontology): a core ontology, modeling artifactual information entities
• SWO (Software Ontology): software modeling
• OMTD-Share (from the OpenMinTed project): modeling of automatic language processing tasks.

New classes and properties have been introduced where necessary. The following are documented for the various tools: languages and input data types; possible operating systems, underlying programming languages, user interfaces and tasks performed; design countries, licenses used and fields of application; and all useful access information: home page, repositories and download locations.

The thesaurus can be downloaded in the following formats: CSV, SKOS-XML and JSON-LD.

The source ontology is available for download here.

An interactive online visualization is available on TM Tools Explorer.


Institute for scientific and technical information (Inist) - CNRS/UAR76



Date de création

Monday, June 8, 2020 00:00:00

Date de dernière modification

Thursday, June 22, 2023 00:00:00

Nom d'attribution

Institute for scientific and technical information (Inist) - CNRS/UAR76

URL d'attribution


This resource contains 1321terminological entries.

Titre court



Nombre d'entrées par type


Nombre de termes par langue

Langue Termes préférentiels Termes synonymes Termes cachés