GitHub topics: extraction
aphp/edspdf
EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-learning-based approaches to classify text blocs between body and meta-data.
langage: Python - taille: 8,93 Mo - dernière synchronisation: il y a 3 jours - enregistré: il y a 5 mois - étoiles: 51 - forks: 7

Related Keywords