An open API service providing repository metadata for many open source software ecosystems.

GitHub / aphp / edspdf

EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-learning-based approaches to classify text blocs between body and meta-data.

JSON API: https://ecosystem.code.gouv.fr/api/v1/hosts/GitHub/repositories/aphp%2Fedspdf

Stars: 57
Forks: 7
Open issues: 0

License: bsd-3-clause
Language: Python
Size: 8.93 MB
Dependencies parsed at: 0

Created at: over 3 years ago
Updated at: 19 days ago
Pushed at: 9 months ago
Last synced at: about 18 hours ago

Commit Stats

Commits: 293
Authors: 10
Mean commits per author: 29.3
Development Distribution Score: 0.621
More commit stats: https://commits.ecosystem.code.gouv.fr/hosts/GitHub/repositories/aphp/edspdf

Topics: extraction, machine-learning, pdf

No dependencies found