Name	Name	Last commit message	Last commit date
Latest commit History 15 Commits
corpy	corpy
rust	rust
.gitignore	.gitignore
Pipfile	Pipfile
Pipfile.lock	Pipfile.lock
README.rst	README.rst
setup.py	setup.py

Name

Last commit message

Last commit date

CorPy

What is CorPy?

A fancy plural for corpus ;) Also, a collection of handy but not especially mutually integrated tools for dealing with linguistic data. It abstracts away functionality which is often needed in practice in day to day work at the Czech National Corpus, without aspiring to be a fully featured or consistent NLP framework.

Currently available sub-packages are:

morphodita: tokenizing and tagging raw textual data using MorphoDiTa
vertical: parsing corpora in the vertical format devised originally for CWB, used also by (No)SketchEngine

Installation

$ pip3 install git+https://github.com/dlukes/corpy

Requirements

Only recent versions of Python 3 are supported by design.

License

Distributed under the GNU General Public License v3.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CorPy

What is CorPy?

Installation

Requirements

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

dlukes/corpy

Folders and files

Latest commit

History

Repository files navigation

CorPy

What is CorPy?

Installation

Requirements

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages