An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: embeddings

VIGINUM-FR/D3lta

A Python implementation of the D3lta algorithm for duplicated textual content detection

Language: Jupyter Notebook - Size: 20.8 MB - Last synced at: 2 days ago - Pushed at: 4 months ago - Stars: 53 - Forks: 8

etalab-ia/mediatech

Collection of public datasets from the French administration, vectorized and ready to use in AI projects.

Language: Python - Size: 340 KB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 4 - Forks: 1

ina-foss/twembeddings

Sentence embeddings for unsupervised event detection in the Twitter stream: study on English and French corpora

Language: Jupyter Notebook - Size: 40.9 MB - Last synced at: 2 days ago - Pushed at: 5 months ago - Stars: 31 - Forks: 5

France-Travail/embcompare

A simple python tool for embedding comparison

Language: Python - Size: 27.9 MB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 0