7,578 projects with the selected classifier
Machine learning software for organizing data into categories
utoken is a universal tokenizer (multilingual word segmenter) that divides text into words, punctuation and special tokens such as numbers, URLs, XML tags, email-addresses and hashtags. It comes with a companion detokenizer.
Toolkit for reading and writing EMDB-SFF files
A python package to automatically generate API documents for Python modules.
The hyphenation library of LibreOffice and FireFox wrapped for Python
TextQuoter is a versatile Python script designed to simplify the processing of quotation marks in a given string.
Yet Another Option Parser (full fledged)
Trivial split for strings with quotes and escaped characters
Polish stemmer.
Quick setup utility for sphinx
The atomic CSS compiler
Toolkit that simplifies corpus processing
ipapy is a Python module to work with IPA strings
Web-based software suite for Computational Linguistic Analysis based on construction grammars and ontologies (HPSG, SBCG, xMRS, Wordnet, texttaglib)
LaTeX Jinja2 i18n utilities.
Collects and extracts URLs from given text. Forked from https://pypi.python.org/pypi/urlextract.
Ontonotes-5-parsing: parser of Ontonotes 5.0 to transform this corpus to a simple JSON format.
教典用字
Python documentation generator
Supported by