Below are some of the software packages I have written and am giving away under relatively loose licensing terms, and in some cases under no terms whatsoever. If you run across alternative implementations of these tools, please send me links and I will update these pages with this information. Please forgive me if I canÕt give step-by-step instructions to suit your particular needs.
PennAspectÐ A java implementation of the Aspect model, a belief network that has prevailed in many communities under various names. In the natural language processing and data mining worlds the namesÒaspect modelÓ or Òprobabilistic latent semantic indexingÓ are prevalent terms for this model. A third party has translated our code into C++ and incorporated it into the Lemur toolkit.
ROCtoolsÐ includes ROC curves and the CROC curve variant for recommender systemevaluation. There are many ROC curve implementations out there. What makes this one different is that it can handle very large datasets. It lacks many of the common add-ons such as error-bars and curve smoothing.
Logistic
PCA Ð A principal component analysis technique for binary
data.
PCLR Ð An algorithm that
predicts protein localization to the chloroplasts in
plants.
More to come soon, I hope!