GitHub topics: embeddings
VIGINUM-FR/D3lta
A Python implementation of the D3lta algorithm for duplicated textual content detection
Language: Jupyter Notebook - Size: 20.8 MB - Last synced at: about 11 hours ago - Pushed at: 6 months ago - Stars: 57 - Forks: 8
etalab-ia/mediatech
Collection of public datasets from the French administration, vectorized and ready to use in AI projects.
Language: Python - Size: 486 KB - Last synced at: 3 days ago - Pushed at: 10 days ago - Stars: 7 - Forks: 2
ina-foss/twembeddings
Sentence embeddings for unsupervised event detection in the Twitter stream: study on English and French corpora
Language: Jupyter Notebook - Size: 40.9 MB - Last synced at: about 7 hours ago - Pushed at: 6 months ago - Stars: 32 - Forks: 5
France-Travail/embcompare
A simple python tool for embedding comparison
Language: Python - Size: 27.9 MB - Last synced at: 2 days ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 0