We are using a large archive of newspaper stories(GigaWordCorpus) as input to a parallel MPI program, and produce from that a list of top R terms of varying lengths M through N that are especially interesting.
The program is done in C using MPI.
License
GNU General Public License version 2.0 (GPLv2)Follow GigaWordCorpus
Other Useful Business Software
Iris Powered By Generali - Iris puts your customer in control of their identity.
Iris Identity Protection API sends identity monitoring and alerts data into your existing digital environment – an ideal solution for businesses that are looking to offer their customers identity protection services without having to build a new product or app from scratch.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of GigaWordCorpus!