An open API service providing repository metadata for many open source software ecosystems.

Package Usage: pypi: smart-open

Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)
62 versions
Latest release: about 1 year ago
248 dependent packages
32,367,417 downloads last month

View more package details: https://packages.ecosystem.code.gouv.fr/registries/pypi.org/packages/smart-open

Dependent Repos 29

florilege-team/florilege-website
Florilège est un projet français d'annotation participative de RELs (Ressource Educatives Libres).

Last synced: 8 months ago - Pushed: 12 months ago

pub/ecolab/liriae/liriae-form

Last synced: 8 months ago - Pushed: 12 months ago

aaristov/multichip-snakemake
Process the droplet chips using snakemake pipelines.

Last synced: about 1 year ago - Pushed: 12 months ago

umr-tetis/mood/mood_tweets_ner_content

Last synced: 8 months ago - Pushed: 12 months ago

entrepreneur-interet-general/tf-han

Size: 260 KB - Last synced: 1 day ago - Pushed: over 2 years ago

medialab/keyfayqua
Qui fait quoi ? : NLP tools to detect subject-object-verb triples in French and English

Size: 111 KB - Last synced: 1 day ago - Pushed: over 1 year ago

medialab/spsm-database

Size: 2.46 MB - Last synced: 1 day ago - Pushed: over 1 year ago

agora-gouv/agora-nlp

Size: 102 KB - Last synced: 3 days ago - Pushed: about 1 year ago

diplomatiegouvfr/bna
Baromètre Numérique de l’Agent

Size: 81.4 MB - Last synced: 2 days ago - Pushed: over 2 years ago

pass-culture/data-gcp
Repo pour la team data sur GCP

Size: 18.9 MB - Last synced: 1 day ago - Pushed: 1 day ago

medialab/DeFacto 📦
Tools to enrich De Facto's database

Size: 16.9 MB - Last synced: 1 day ago - Pushed: about 2 years ago

ecolabdata/2021-NLP_AE

Size: 204 MB - Last synced: 1 day ago - Pushed: over 3 years ago

etalab-ia/pseudonymisation_decisions_ce 📦
Temporary repo to split the pseudo livrable

Size: 25.8 MB - Last synced: 4 days ago - Pushed: about 5 years ago

medialab/chatgpt-study
Exploratory research on discourses around ChatGPT and AI.

Size: 79.1 KB - Last synced: 1 day ago - Pushed: almost 2 years ago

21901956/deep-learning-voiture-autonome-projet-m1

Last synced: 12 months ago - Pushed: 12 months ago

remy.decoupes/covid19-tweets-mood-tetis
Extract terms from tweets about covid19 : 1/ Extact and Preprocess tweets from https://github.com/echen102/COVID-19-TweetIDs 2/ Index in elasticsearch and build H-TFIDF 3/ Analyse resultst

Last synced: 12 months ago - Pushed: 12 months ago

etalab-ia/ami-ia-dgs
Dépôt de code pour le projet AMI IA 2 de la DGS.

Size: 38.1 MB - Last synced: 4 days ago - Pushed: over 2 years ago

aphp/eds-pseudo
EDS-Pseudo is a hybrid model for detecting personally identifying entities in clinical reports

Size: 4.78 MB - Last synced: 1 day ago - Pushed: about 2 months ago

almanach/lectaurep/ner
Un projet Gitlab pour rassembler le travail fait sur le NER dans le cadre de Lectaurep.

Last synced: 8 months ago

herelles/herelles-corpora-builder
Automatic protocol for the constitution of spatio-temporal and thematic corpora for the Herelles project.

Last synced: 8 months ago - Pushed: 12 months ago

deep-learning-applied-on-web-and-iot-security/concatenation-deep-learning-detector
The Concatenation Detector helps you to build deep learning models to detect statically web vulnerability - especially Cross-Site Scripting XSS - based on Natural Language Processing (NLP)

Last synced: 8 months ago

cedar/statstical_mentions
Extracting statistical mentions from textual claims to provide trusted content

Last synced: 8 months ago

cedar/excel-search

Last synced: 8 months ago

ecrinum/graph-ta-recherche

Last synced: 8 months ago

cedar/connection-studio
ConnectionStudio integrates highly heterogeneous data into graphs, enriched with extracted entities. Studio users can discover the entities in their data, navigate across connections between datasets, explore and query the data in many ways. The Studio currently supports: CSV, JSON, XML, RDF, text, property graphs, all Office formats, and PDF datasets. For more information, see: https://connectionstudio.inria.fr The scientific publications behind the platform: https://team.inria.fr/cedar/connectionlens/

Last synced: 8 months ago

mgillelevenson/tei_collator
Outil de collation automatisée TEI > TEI produit dans le cadre de ma thèse de doctorat

Last synced: about 1 year ago - Pushed: 12 months ago

mgillelevenson/lemmatisation_xml_tei
Lemmatisation d'un fichier xml-tei avec Pie (latin médiéval), Freeling (castillan/castillan médiéval) ou CLTK (latin classique).

Last synced: about 1 year ago - Pushed: 12 months ago

methal/corpus-methal-all
Travaux du mémoire M2 de Heng Yang sur l'extraction de règles de variation orthographique dans les dialectes alsaciens sur la base d'un corpus de pièces de théâtre

Last synced: 8 months ago