This homework is out of 40 points and should be performed individually (i.e. not with your project partner).
For this homework, you are to complete the project progress requirements as detailed in the project suggestions document. You must first download the netflix data set (available here). You must then compute each of the five following properties of the dataset:
In addition to the five properties given above, you should also investigate five other interesting properties fo the dataset that are relevant to your proejct. Be sure to describe how each of these properties are relevant to your project. At least two of these properties must involve plotting a relation between two variables (i.e. similar to properties 2 and 4 listed above). You should provide a clear writeup describing each of these properties. In addition to turning in this writeup in class, you should also put up a copy online (perhaps in the public_html directory of your cs account). Send a link to jdavis@cs.utexas.edu. Your writeup will then be made available for the rest of the class to use as a reference for the project.
Make sure you turn in any code you used to generate these plots. The coding problems may be implemented in the language of your choice. Print out all code written and attach to your homework solutions. Code should be clearly written and well-commented to receive full credit.