An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: embeddings

etalab-ia/mediatech

Collection of public datasets from the French administration, vectorized and ready to use in AI projects.

Language: Python - Size: 486 KB - Last synced at: 2 days ago - Pushed at: about 2 months ago - Stars: 10 - Forks: 3

ina-foss/twembeddings

Sentence embeddings for unsupervised event detection in the Twitter stream: study on English and French corpora

Language: Jupyter Notebook - Size: 40.9 MB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 33 - Forks: 5

VIGINUM-FR/D3lta

A Python implementation of the D3lta algorithm for duplicated textual content detection

Language: Jupyter Notebook - Size: 20.8 MB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 58 - Forks: 8

France-Travail/embcompare

A simple python tool for embedding comparison

Language: Python - Size: 27.9 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 0