Author: Dawei Li
Topic: Category-based N-gram Counting and Analysis Using Map-Reduce Framework
Class Size: 14 students
We use one GENI account (through Flukes) to create slices for all students, and distribute the SSH private key as well as the assigned IP address to each of them.
|Controllers Used||OSF (7 slices/7 nodes each) WVN (8 slices/3 nodes each)|
|Duration||9 days (11 slices) 14 days (4 slices)|
The Flukes tool is really convenient. I just spend a few hours to figure out how to use it and how to create a Hadoop cluster. Only one thing is that I have to poll the status of the slice myself to know if it is ready or not. As far as I know, the Flack GUI of ProtoGENI can poll it automatically and show users the changing status until it is ready.
I have heard no complaint from students about connection problem, meaning that the testbed resources are relatively stable whether accessing on campus or not. Some students cannot log into the testbed just because they are not familiar with SSH. However, one grader said that he couldn’t log into the testbed on May 3rd (around midnight) using one OSF slice, but he can log in again the next morning.