T2 package - develop/libtextcat - A library for efficient, lightweight text classification

libtextcat: A library for efficient, lightweight text classification¹

Libtextcat is a library with functions that implement the classification technique described in Cavnar Trenkle, N-Gram-Based Text Categorization. It was primarily developed for language guessing, a task on which it is known to perform with near- perfect accuracy. Considerable effort went into making this implementation fast and efficient. The language guesser processes over 100 documents/second on a simple PC, which makes it practical for many uses.

... part of T2, get it here

URL: http://software.wise-guys.nl/libtextcat/

Author: Frank Scheelen <frank [at] wise-guys [dot] nl>
Maintainer: The T2 Project <t2 [at] t2-project [dot] org>

License: BSD
Status: Stable
Version: 2.2

Remark: Does cross compile (as setup and patched in T2).

Download: http://software.wise-guys.nl/download/ libtextcat-2.2.tar.gz

T2 source: libtextcat.cache
T2 source: libtextcat.desc

Build time (on reference hardware): 3% (relative to binutils)²

Installed size (on reference hardware): 0.14 MB, 11 files

Dependencies (build time detected): 00-dirtree bash binutils coreutils diffutils findutils gawk grep gzip linux-header make sed tar

Installed files (on reference hardware): [show]

¹) This page was automatically generated from the T2 package source. Corrections, such as dead links, URL changes or typos need to be performed directly on that source.

²) Compatible with Linux From Scratch's "Standard Build Unit" (SBU).

libtextcat: A library for efficient, lightweight text classification1

libtextcat: A library for efficient, lightweight text classification¹