Software I have written

 

 

Below are some of the software packages I have written and am giving away under relatively loose licensing terms, and in some cases under no terms whatsoever.   If you run across alternative implementations of these tools, please send me links and I will update these pages with this information.  Please forgive me if I canÕt give step-by-step instructions to suit your particular needs.

 

 

PennAspectÐ A java implementation of the Aspect model, a belief network that has prevailed in many communities under various names. In the natural language processing and data mining worlds the namesÒaspect modelÓ or Òprobabilistic latent semantic indexingÓ are prevalent terms for this model. A third party has translated our code into C++ and incorporated it into the Lemur toolkit.

 

ROCtoolsÐ includes ROC curves and the CROC curve variant for recommender systemevaluation.  There are many ROC curve implementations out there. What makes this one different is that it can handle very large datasets.  It lacks many of the common add-ons such as error-bars and curve smoothing.

 

Logistic PCA Ð A principal component analysis technique for binary data.  Implements the model-fitting strategy introduced in my paper,  ÒA generalized linear model for principal component analysis of binary data.Ó  See my publications page for more details.  The code is implemented as a Matlab procedure.

 

PCLR Ð An algorithm that predicts protein localization to the chloroplasts in plants.   A web-based version of the algorithm describedin Nucleic Acids Research, 2001, Vol 29, No. 16 e82.  You can download the software that runs on the site.

 

More to come soon, I hope!

 

 

 

Main Page