We are using a large archive of newspaper stories(GigaWordCorpus) as input to a parallel MPI program, and produce from that a list of top R terms of varying lengths M through N that are especially interesting.
The program is done in C using MPI.

Project Activity

See All Activity >

License

GNU General Public License version 2.0 (GPLv2)

Follow GigaWordCorpus

GigaWordCorpus Web Site

Other Useful Business Software
Iris Powered By Generali - Iris puts your customer in control of their identity. Icon
Iris Powered By Generali - Iris puts your customer in control of their identity.

Increase customer and employee retention by offering Onwatch identity protection today.

Iris Identity Protection API sends identity monitoring and alerts data into your existing digital environment – an ideal solution for businesses that are looking to offer their customers identity protection services without having to build a new product or app from scratch.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of GigaWordCorpus!

Additional Project Details

Operating Systems

BSD, Linux

Intended Audience

Science/Research

Programming Language

C

Related Categories

C Text Processing Software, C Information Analysis Software

Registered

2008-10-23