[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Savannah-hackers] submission of MG4J - savannah.nongnu.org
From: |
vigna |
Subject: |
[Savannah-hackers] submission of MG4J - savannah.nongnu.org |
Date: |
Tue, 04 Feb 2003 14:59:04 -0500 |
User-agent: |
Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.2) Gecko/20021202 |
A package was submitted to savannah.nongnu.org
This mail was sent to address@hidden, address@hidden
Sebastiano Vigna <address@hidden> described the package as follows:
License: gpl
Other License:
Package: MG4J
System name: mg4j
Type: non-GNU
Description:
MG4J (Managing Gigabytes for Java) is a collaborative effort aimed at providing
a free Java implementation of inverted-index compression techniques; as a
by-product, it offers several general-purpose optimised classes, including fast
& compact mutable strings, bit-level I/O, (possibly signed) minimal perfect
hashing, etc.
Generating full-text inverted indices for very large sets of documents
(say, beyond dozens of millions) is a nontrivial task. MG4J tries to make the
techniques described in the book Managing Gigabytes, by Ian Witten, Alistair
Moffat and Timothy Bell, accessible without having to deal with bit-level
operations in a clean, object-oriented environment.
You can find APIs, etc. at http://vigna.dsi.unimi.it/MG4J/
Other Software Required:
The COLT distribution (http://tilde-hoschek.home.cern.ch/~hoschek/colt/)
fastUtil (http://vigna.dsi.unimi.it/fastUtil/)
Other Comments:
- [Savannah-hackers] submission of MG4J - savannah.nongnu.org,
vigna <=