Just a friendly reminder, the final exam will be tomorrow on December 13 from 10:30-12:30pm in the room LWSN B155. Good luck everyone!
It is my pleasure to announce that we had an exciting and diverse set of final projects this year. The topics spanned
- Deep Embedded Join (John Moore and Evan Hanau)
- Supporting Semantic Queries in Spark using Word Embedding (Alina Nesen)
- Analysis of TPC-C Benchmark using RedBlue Consistency Model (Fei Wang and Xingang Wang)
- Fine-grained, Mutable Access on Taychon (Aman Preet Singh and Nikita Gupta)
- Extending L-Store to Support Temporal and Dynamic Graphs (Chih-Hao Fang and I-Ta Lee)
- Data Skipping Using Synopsis/Bitmaps in Spark (Anshu Maheshwari, Ishan Chawla, and Madhav Kapoor)
- Data Skipping Using Synopsis in Spark (Aakanksha Choudhary and Vedant Mishra)
- Sparkyon: Cost-based View Materialization in Spark through Tacyhon (Prashant Ravi and Henry Jebasingh Elilarasu)
- Query Optimization in SparkSQL (Anil Kumar Reddy Pulakanti and Sai Chowdary Samineni)
- Materialized View Selection Using Integer Programming (Tao Jiang and Yu Qiao)
- Materialized View Selection Using Dynamic Programming (Chengzhang Li and Zhongjie Ma)
- Accelerating SparkSQL and Hive Using Materialized Views (Samodya Abeysiriwardane and Karan Prabhu)
- Evaluating Join Re-ordering Strategies in Hive (Emily Kossler and Ayush Parolia)
- Building a Query Optimizer for Map-Reduce (Sowmya Rupa Siddareddy and Sneha Balasubramanian)
- Hadoop vs. Spark: A Comparative Study (Alex Chirayath and Kartik Killawala)
- Analyzing Hive and SparkSQL using TPC-H Benchmark (Guangtong Shen and Haizhou Mo)
November 17, 2016: The deadline for the final project has been extended to
Friday, December 9 at the beginning of class (a firm deadline). Late projects will not be accepted! Please submit a hardcopy of your final project report (in class) and email an electronic version (including all your code, test cases, evaluation, report, etc.) to the instructor.
November 17, 2016: Programming Assignment 4 is out. It is due on Wednesday, December 7 on Blackboard (electronic submission). There will be a 10% penalty for each late day. After 5 late days, the homework will not be accepted.
November 11, 2016: Homework 3 is out. It is due on Monday, December 2 at the beginning of class. There will be a 10% penalty for each late day. After 5 late days, the homework will not be accepted.
October 31, 2016: Programming Assignment 3 is out. It is due on Wednesday, November 16 on Blackboard (electronic submission). There will be a 10% penalty for each late day. After 5 late days, the homework will not be accepted.
October 14, 2016: Homework 2 is out. It is due on Monday, October 31 at the beginning of class. There will be a 10% penalty for each late day. After 5 late days, the homework will not be accepted.
October 5, 2016: Programming Assignment 2 is out. It is due on Tuesday, October 25 on Blackboard (electronic submission). There will be a 10% penalty for each late day. After 5 late days, the homework will not be accepted.
October 3, 2016: To further help you progress on the
final project, we are providing a detailed
handout that describes how to run a subset of TPC-H queries on Hive and SparkSQL. To access our Hadoop/Spark cluster, please read the cluster setup
handout.
September 23, 2016: Please submit a one-page
final project proposal (as a team) by Friday, October 7 at the beginning of the class.
September 9, 2016: Programming Assignment 1 is out. It is due on Friday, September 30 on Blackboard (electronic submission). There will be a 10% penalty for each late day. After 5 late days, the homework will not be accepted.
September 9, 2016: Homework 1 is out. It is due on Friday, September 23 at the beginning of class. There will be a 10% penalty for each late day. After 5 late days, the homework will not be accepted.
September 7, 2016: To help you get started on the final project, we have set up a Hadoop/Spark cluster. For more details, please read the cluster setup
handout. The
final project deadline is on December 1, 2016.
September 4, 2016: We will have a guest lecture by ExxonMobil on Wednesday, September 6 titled "ExxonMobil Databases and Analytics – Unleash your Data Potential".
August 22, 2016: Welcome to CS 541.
Course materials/grades will be made available on your Blackboard account.