Real Time vs Micro-batching in Streaming Data Processing: Performance and Guidelines

Name
Mansur Alizada
Abstract
Data is used in every second of our life. Nowadays, the majority of this data is coming through the Internet. For providing better fast and scalable service, technologies needed to be efficient and scaled regarding those needs. The initiative of this thesis to provide simple workload for engine comparison. In this master thesis, I will focus on Apache Flink, Spark Streaming, Apache Kafka, Apache Storm, Storm Trident for real-time and micro-batch in streaming data processing. This thesis aims to show the comparisons among those technologies.
Graduation Thesis language
English
Graduation Thesis type
Master - Computer Science
Supervisor(s)
Pelle jakovits
Defence year
2021
 
PDF