Posts

Showing posts with the label #VMWare

Cloudera

Image
How-to: Analyze Fantasy Sports using Apache Spark and SQL As part of the drumbeat for  Spark Summit West  in San Francisco (June 6-8),  learn how analyzing stats from professional sports leagues is an instructive use case for data analytics using Apache Spark with SQL. In the United States, many diehard sports fans morph into amateur statisticians to get an edge over the competition in their fantasy sports leagues. Depending on one�s technical chops, this �edge� is usually no more sophisticated than simple spreadsheet analysis, but some particularly intense people go to the extent of creating their own player rankings and projection systems. Online tools can provide similar capabilities, but it�s not often transparent where the numbers come from. Although the data involved is not large in volume, the types of data processing, data analytics, and machine-learning techniques used in this area are common to many Apache Hadoop use cases. So, fantasy sports analytics prov...