Publication Date
Spring 2019
Degree Type
Master's Project
Degree Name
Master of Science (MS)
Department
Computer Science
First Advisor
Robert Chun
Second Advisor
Jon Pearce
Third Advisor
Manoj Thakur
Keywords
Lambda Architecture (LA), Batch Processing, Stream Processing, Real-time Data Processing
Abstract
Data has evolved immensely in recent years, in type, volume and velocity. There are several frameworks to handle the big data applications. The project focuses on the Lambda Architecture proposed by Marz and its application to obtain real-time data processing. The architecture is a solution that unites the benefits of the batch and stream processing techniques. Data can be historically processed with high precision and involved algorithms without loss of short-term information, alerts and insights. Lambda Architecture has an ability to serve a wide range of use cases and workloads that withstands hardware and human mistakes. The layered architecture enhances loose coupling and flexibility in the system. This a huge benefit that allows understanding the trade-offs and application of various tools and technologies across the layers. There has been an advancement in the approach of building the LA due to improvements in the underlying tools. The project demonstrates a simplified architecture for the LA that is maintainable.
Recommended Citation
Malusare, Omkar Ashok, "Real-Time Data Processing With Lambda Architecture" (2019). Master's Projects. 681.
DOI: https://doi.org/10.31979/etd.2s5u-hgps
https://scholarworks.sjsu.edu/etd_projects/681