Home
Education
Research
Teaching
Students
                                 

My research is in the broad area of Database Management Systems and Information Management. In particular, I work in the areas of query processing and optimization, indexing techniques, scientific data management, and Big data and metadata analytics. My recent work has explored extending Hadoop infrastructure to support complex operations such as joins and aggregations efficiently on large-scale datasets. Currently, I am exploring possible extensions to both database management systems and Hadoop framework to support scientific applications and health-care systems.


Recent Funding
NSF Award

PI- National Science Foundation (NSF)-Award: CRI-1305258: 8/1/13 - 7/31/14, “Compute Infrastructure for Large-Scale Data Analytics”, $189,952.

Recent Research Projects
* InsightNotes: Large-Scale Annotation Management.

* Redoop: Recurring Queries in MapReduce Infrastructure.

* HandsOn DB: Managing Human-Involved Dependencies in RDBMS.

* CoHadoop & E3: Hadoop-Based Query Optimizations.


Selected Publications: DBLP Google Scholar
1-


2-


3-


4-


5-

6-


7-


8-


9-


10-


11-


12-


13-


14-


15-


16-


17-



18-


19-


20-
Chuan Lei, Zhongfang Zhuang, Elke Rundensteiner, Mohamed Y. Eltabakh, "Redoop Infrastructure for Recurring Big Data Queries", Demo in VLDB 2014, To Appear. [pdf]

Dongqing Xiao, Mohamed Y. Eltabakh, "InsightNotes: Summary-Based Annotation Management in Relational Databases", SIGMOD 2014, Utah, USA. [pdf]

Yang Zheng, Annies Ductan, Devin Thomas,  Mohamed Y. Eltabakh, "Complex Patten Processing in Spatio-Temporal Databases", DATA 2014, Vienna, Austria, To Appear.  [pdf]

Anh Pham, Mohamed Y. Eltabakh, "FunctionGuard: A Query Engine For Expensive Scientific Functions In Relational Databases", DATA 2014, Vienna, Austria, To Appear.  [pdf]

Chuan Lei, Elke Rundensteiner, Mohamed Y. Eltabakh, “Redoop: Supporting Recurring Queries in Hadoop’’, EDBT 2014, Athens, Greece. [pdf]

Karim Ibrahim, Nathaniel Selvo, Mohamed El-Rifai, Mohamed Y. Eltabakh, “FusionDB: Conflict Management System for Small-Science Databases’’, Demo in CIKM, 2013, pp. 2469- 2472, CA, USA. [pdf]
   
Dongqing Xiao, Mohamed Y. Eltabakh, “STEPQ: Spatio-Temporal Engine for Complex Pattern Queries”, International Symposium on Spatial and Temporal Databases (SSTD) 2013, pp. 386-390, Munich Germany. [pdf]

Mohamed Eltabakh, Fatma Ozcan, Yannis Sismanis, Peter Haas, Hamid Pirahesh, and Jan Vondrak. "Eagle-Eyed Elephant: Split-Oriented Indexing in Hadoop",  EDBT 2013, Genoa, Italy. [pdf]

Mohamed Eltabakh, Walid Aref, Ahmed Elmagarmid, Mourad Ouzzani, “HandsOn DB: Managing Data Dependencies involving Human Actions”,  TKDE, 2013. [pdf]

Mohamed Y. Eltabakh, Jalaja Padma, Yasin N. Silva, Walid G. Aref, Elisa Bertino, “Query Processing with K-Anonymity”, International Journal of Data Engineering (IJDE), (3, 2), pp. 48-65, 2012. [pdf]

Mohamed Y. Eltabakh, Yuanyuan Tian, Fatma Ozcan, Rainer Gemulla, Aljoscha Krettek, John McPherson, "CoHadoop: Flexible Data Placement and Its Exploitation in Hadoop", PVLDB 4(9): 575-585, 2011. [pdf]

Kevin Beyer, Vuk Ercegovac, Rainer Gemulla, Andret Balmin, Mohamed Y. Eltabakh, Carl-Christian Kanne, Fatma Ozcan, Eugebe Shekita, "Jaql: A Scripting Language for Large Scale Semi-Structured Data Analysis", PVLDB 4(9), 2011. [pdf]

Mohamed Y. Eltabakh, Walid G. Aref, Ahmed K. Elmagarmid, Yasin Silva, Mourad Ouzzani, “Supporting Real-world Activities in Database Management Systems”, Short paper in the International Conference on Data Engineering (ICDE) 2010, Los Angeles, CA, pp 808-811. [pdf]

Mohamed Y. Eltabakh, Walid G. Aref, Ahmed K. Elmagarmid: "A database server for next-generation scientific data management". ICDE PhD Workshops 2010: 313-316. [pdf]

Mohamed Y. Eltabakh, Walid G. Aref, Ahmed K. Elmagarmid, Mourad Ouzzani, Yasin  Silva “Supporting Annotations on Relations”, In Proceedings of the International Conference on Extending Database Technology (EDBT) 2009, Saint-Petersburg, Russia, pp.  379-390. [pdf]

Mohamed Y. Eltabakh, Wing-Kai Hon, Rahul Shah, Walid G. Aref,  Jeffrey S. Vitter, “The SBC-Tree: An Index for Run-Length Compressed Sequences”, In Proceedings of the International Conference on Extending Database Technology (EDBT) 2008, Nantes, France, pp. 523-534. [pdf]

Mohamed Y. Eltabakh, Mourad Ouzzani, Walid G. Aref, Ahmed K. Elmagarmid, Yasin Laura-Silva, Muhammad Arshad, David Salt, Ivan Baxter “Managing Biological Data using bdbms”, Demo in the International Conference on Data Engineering (ICDE) 2008, Cancun, Mexico, pp. 1600-1603. [pdf]

Mohamed Y. Eltabakh, Mourad Ouzzani, Walid G. Aref, “Duplicate Elimination in Space-partitioning Tree Indexes”, In Proceedings of  the  International  Conference  on  Scientific  and  Statistical  Database  Management (SSDBM) 2007, Banff, Canada, pp. 18-27. [pdf]

Rafae Bhatti, Arjumand Samuel, Mohamed Y. Eltabakh, Haseeb Amjad, Arif Ghafoor, “Engineering a Policy-Based System for Federated Healthcare Databases”, IEEE Transactions on Knowledge and Data Engineering Journal (TKDE) 2007 19(9), pp. 1288-1304. [pdf]

Mohamed Y. Eltabakh, Mourad Ouzzani, Walid G. Aref, “bdbms: A Database Management System for Biological Data”, In Proceedings of the Conference on Innovative Data Systems Research (CIDR) 2007, Asilomar, CA, pp.196-206. [pdf]