Skip to content
@tetis-nlp

Text-Mining team from TETIS lab

Code developed by the text-mining team of the TETIS lab

TETIS - Text-Mining Team Repository Guide

This repository serves as the central hub for software production by the Text-Mining team at TETIS. It includes tools developed by our team, code for reproducibility, internship projects, and training materials.


🛠 Tools

Type Name Description Repository Link
Local Python Package SNEtoolkit Absolute Spatial Named Entities extraction and disambiguation tool GitHub
Local Streamlit Application THECOB Automatic protocol for the constitution of spatio-temporal and thematic corpora GitHub
 local Streamlit Application GeospaCy GeospaCy is a web application built in Python language used for extracting spatial relation entities (spatRE) from text and Geo-referenced them Github
 python script CompEBS This project aims to compare several EBS tools in terms of spatio-temporal and thematic aspects institutional gitlab
Pip Package GeoNLPlify NLP library for data augmentation focusing on spatial information contained in text GitHub

📄 Article Reproducibility

Journal/Conference Name Description Repository Link
AGILE-2021 H-TFIDF This project aims to extract discriminative terms on spatial and time windows institutional gitlab
LREC-2022 Enriching Epidemiological Thematic Features For Disease Surveillance Corpora Classification  - Github
ESWA Explainable epidemiological thematic features for event based disease surveillance - Github
DS-2024 Geographical Biases in LLMs Evaluation of the quality of LLM geo knowledge GitHub
EGC-2025 Text 2 SQL for LandMatrix Text-to-GraphQL / API REST for querying the Land Matrix database GitHub
Frontiers in AI Automating updates for scoping reviews on the environmental drivers of human and animal diseases: a comparative analysis of AI methods Identify risk factor by comparing GenAI / NER with MLM / NER with MLM with data augmentation Github

🎓 Work of our students

Author Name Description Repository Link Year
Rida Asri CSP-GNN-Analyse-et-l-explication-de-l-artificialisation-des-sols-partir-d-images-HR Dans le cadre du projet ANR Hérelles, ce projet développe une méthode hybride combinant Graph Neural Networks et CSP pour détecter et interpréter l’artificialisation des sols à partir d’images satellitaires haute résolution, fournissant des explications factuelles et contrefactuelles pour appuyer la gestion durable des territoires. github 2025
 Rosalie Corine Nkounghawe Tomeyum SURSY Développer une approche robuste de classification et d’augmentation de données textuelles pour la veille syndromique en santé végétale github 2025
Noureddine Saidi STAY Web App  a tool for temporal, spatial, relational, and semantic analysis of knowledge shared on YouTube about the phenomenon of self-sufficiency in France, developed as part of the STAY project (Technical Knowledge for Self-Sufficiency on YouTube) web app & first work 2025
Aicha Zouhair landmatrix-resourcecontracts-feeder Export mining deals from ResourceContracts to Land Matrix.  github 2025
Fatiha Ait Kbir Text 2 SQL for LandMatrix Text-to-GraphQL / API REST for querying the Land Matrix database GitHub 2024
Nelson Jaimes-Quintero food-insecurity-risk-mining Automatic named entity recognition pipeline to identify possible drivers of food insecurity in French-language news articles. The project supports event extraction (EE) using sentiment analysis and links TIME and LOCATION entities to event mentions. GitHub 2024

📚 Training Materials

Type Name Description Link
Google Colab 2022 - H2020 MOOD PhD school This notebok is used for the "Mining Media Data" session of the MOOD Summer School 2022 GitHub
Google Colab 2023 - pratical-session-nlp-for-one-health-murdoch-mood Practical session on NLP for One Health - Murdoch Mood GitHub
Google Colab 2024 - Geographical Biases in LLMs Evaluation of the quality of LLM geo knowledge GitHub
Google Colab 2024 - ETTM INRAE/DipSO/ASTRA: Vectorisation, clusterisation and classification institutional gitlab

Feel free to explore, contribute, and reach out with any questions! 🚀

Popular repositories Loading

  1. geographical-biases-in-llms geographical-biases-in-llms Public

    Evaluation of the quality of LLM geo knowledge

    Python 5

  2. food-insecurity-risk-mining food-insecurity-risk-mining Public

    Automatic named entity recognition pipeline to identify possible drivers of food insecurity in news articles written in French language. The project aims to support the event extraction (EE) task u…

    Jupyter Notebook 3

  3. pratical-session-nlp-for-one-health-murdoch-mood pratical-session-nlp-for-one-health-murdoch-mood Public

    Pratical session NLP For One Health - Murdoch Mood

    Jupyter Notebook 1

  4. landmatrix-graphql-python landmatrix-graphql-python Public

    Text-to-Graphql / API REST for quering the Land Matrix database

    Jupyter Notebook 1

  5. tetis-challenge_textmine_2024 tetis-challenge_textmine_2024 Public

    UMR TETIS's contribution to the TextMine 2024 Challenge

    Jupyter Notebook

  6. automated_scoping_review automated_scoping_review Public

    Automated Updates for Scoping Reviews of Environmental Drivers of Human and Animal Diseases

    Jupyter Notebook

Repositories

Showing 10 of 12 repositories

Top languages

Loading…

Most used topics

Loading…