Homework assignments for CIS 455 / 555
For most assignments, we will provide a virtual machine image that contains all the
necessary tools. To use this image, you will need VirtualBox (free),
VMware Workstation Player (free
for personal use) or VMware Fusion (not free).
Development will be in Java. We recommend the use of Git, a version control system, for maintaining your project code; if you are not familiar with Git, please have a look at the documentation. As a development environment, you may want to use Eclipse, possibly in combination with the EGit plug-in.
Using the Virtual Machine Image
This very simple assignment will show you how to use the virtual machine image we have prepared for you. You also need to download the VM image.
Web and application server|
Some useful URLs:
Web crawler and XPath engine|
For testing, we have set up a sandbox that you can safely crawl.
|Assignment 3||Storm and MapReduce|
|Final Project||Distributed web crawler and search engine|