PhD. Seminar: Maximizing Bigdata Retrieval: Block as a Value for NoSQL over SQL by Abdulrauf Aremu Gidado

Friday, December 1, 2023 - 10:00

The School of Computer Science at the University of Windsor is pleased to present …

 

Title: Maximizing Bigdata Retrieval: Block as a Value for NoSQL over SQL

PhD. Seminar by: Abdulrauf Aremu Gidado

 

Date: Friday December 1, 2023

Time: 10:00 AM – 11:00 AM

Location: Essex Hall 122

 

Abstract:

To date, majority of large corporations such as Amazon and Facebook still have their core solutions (e.g., payments) on relational databases but only use non-relational Bigdata (i.e., NoSQL) database management systems for their non-core systems (e.g., shopping cart) that favor availability and scalability through partitioning while trading off consistency. NoSQL systems are built based on the CAP (Consistency, Availability and Partitioning) database theorem, which satisfies two of these features while trading off one. The need for systems availability and scalability drives the use of NoSQL models, while the lack of consistency and robust query engines as obtainable in relational databases impede their usage. To mitigate these drawbacks, researchers and companies like Amazon, Google and Facebook developed 'SQL over NoSQL' systems such as Amazon’s Dynamo, Google's Spanner, Facebook’s Memcache, Zidian2019, Apache Hive and SparkSQL. These systems create a query engine layer over NoSQL systems but suffer from data redundancy due to lack of normalized database relations and lack consistency obtainable in relational databases. Also, their query engine is not relational complete because they cannot process all relational algebra-based queries as obtainable in a relational database. In this seminar, we present a ‘NoSQL over SQL’ system, an inverse of existing approaches. This approach is motivated by the need for existing systems to fully deploy NoSQL data store functionalities without the limitation of building an extra SQL layer for querying. To allow appropriate storage and retrieval of data on document-based NoSQL databases without data redundancy and inconsistency while encouraging both horizontal and vertical partitioning, we propose NoSQL over SQL Block as a Value (BaaV) data storage strategy...

-based stores with our BaaV model and by using a leader node strategy in the NoSQL stores cluster for our read/write operations while retaining an in-sync replica node similar to Apache Kafka data replication strategy. Using existing benchmark systems of ‘SQL over NoSQL’, relational databases and real-life datasets for our experiments, we demonstrated that our NoSQL over SQL system outperforms existing relational databases, SQL over NoSQL systems and is novel in ensuring data consistency, scalability, query execution and improving data storage and retrieval in large database systems without data loss and enhancing improved performance on NoSQL database.

Keywords: Big Data, NoSQL Database, SQL over NoSQL, NoSQL over SQL, Relational Database, BaaV]

 

PhD Doctoral Committee:

Internal Reader: Dr. Alioune Ngom

Internal Reader: Dr. Curtis Bright

External Reader: Dr. Christian Trudeau (Department of Economics)

Advisor(s): Dr. Christie Ezeife