Papers:
IoT
- Towards internet of things: Survey and future vision. Said, Omar, and Mehedi Masud. International Journal of Computer Networks 5.1 (2013): 1-17.
- Challenges for database management in the internet of things. Cooper, Joshua, and Anne James. IETE Technical Review 26.5 (2009): 320-329.
Streaming
- Data Stream Management Issues: A SurveyGolab, Lukasz, and M. Tamer Ozsu. Technical Report, Apr. 2003. db. uwaterloo.
- Next-Generation Stream ProcessingIEEE Bulletin on Data Engineering.
- S-Store: Streaming Meets Transaction ProcessingMeehan, John, et al. PVLDB 8(13), Sept 2015
Time Series and Sequence Databases
- Gorilla: A Fast, Scalable, In-memory Time Series Database.Tuomas Pelkonen, Scott Franklin, Justin Teller, Paul Cavallaro, Qi Huang, Justin Meza, and Kaushik Veeraraghavan. PVLDB, 8(12):1816–1827, 2015.
- A symbolic representation of time series, with implications for streaming algorithms. Lin, Jessica, et al. Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery. ACM, 2003.
- Querying and mining of time series data: experimental comparison of representations and distance measures.Ding, Hui, et al. Proceedings of the VLDB Endowment 1.2 (2008): 1542-1552.
- Fast subsequence matching in time-series databases.Faloutsos, Christos, Mudumbai Ranganathan, and Yannis Manolopoulos. Vol. 23. No. 2. ACM, 1994.
Signal DBs
- The Case for a Signal-Oriented Data Stream Management System.Girod, Lewis, et al. CIDR. Vol. 7. 2007.
- A comparison of DFT and DWT based similarity search in time-series databases.Wu, Yi-Leh, Divyakant Agrawal, and Amr El Abbadi. Proceedings of the ninth international conference on Information and knowledge management. ACM, 2000.
- Xstream: a signal-oriented data stream management system.Girod, Lewis, et al. 2008 IEEE 24th International Conference on Data Engineering. IEEE, 2008.
Data Cleaning
- Data Cleaning: Problems and Current Approaches.Rahm, Erhard, and Hong Hai Do. IEEE Data Eng. Bull. 23.4 (2000): 3-13.
- Potter's Wheel: An Interactive Data Cleaning SystemRaman, Vijayshankar, and Joseph M. Hellerstein. VLDB. Vol. 1. 2001.
- A Primitive Operator for Similarity Joins in Data CleaningChaudhuri, Surajit, Venkatesh Ganti, and Raghav Kaushik. Data Engineering, 2006. ICDE'06. Proceedings of the 22nd International Conference on. IEEE, 2006.
Anomaly Detection
- An overview of anomaly detection techniques: Existing solutions and latest technological trends.Patcha, Animesh, and Jung-Min Park. Computer networks 51.12 (2007): 3448-3470.
- Event regularity and irregularity in a time unit. Lijian Wan and Tingjian Ge. Event regularity and irregularity in a time unit. In ICDE, 2016.
- MacroBase: Analytic Monitoring for the Internet of Things. Bailis, Peter, et al. arXiv preprint arXiv:1603.00567 (2016).
- Anomaly detection for discrete sequences: A survey.Chandola, Varun, Arindam Banerjee, and Vipin Kumar. IEEE Transactions on Knowledge and Data Engineering 24.5 (2012): 823-839.
Prediction
- Time series prediction using support vector machines: a survey. Sapankevych, Nicholas I., and Ravi Sankar. IEEE Computational Intelligence Magazine 4.2 (2009): 24-38.
- Processing forecasting queries. Cai, Y. Dora, Ruth Aydt, and Robert J. Brunner. Duan, Songyun, and Shivanath Babu. Proceedings of the 33rd international conference on Very large data bases. VLDB Endowment, 2007.
- The Case for Predictive Database Systems: Opportunities and Challenges. Akdere, Mert, et al. CIDR. 2011.
Data Ingestion
- Optimizing Data Warehouse Loading Procedures for Enabling Useful-Time Data Warehousing. Santos, Ricardo Jorge, and Jorge Bernardino. Proceedings of the 2009 International Database Engineering & Applications Symposium. ACM, 2009.
- Optimized Data Loading for a Multi-Terabyte Sky Survey Repository. Cai, Y. Dora, Ruth Aydt, and Robert J. Brunner. Proceedings of the 2005 ACM/IEEE conference on Supercomputing. IEEE Computer Society, 2005.
- Transaction Reordering and Grouping for Continuous Data Loading. Luo, Gang, et al. Business Intelligence for the Real-Time Enterprises. Springer Berlin Heidelberg, 2007. 34-49.
- Improving the Query Performance of High-Dimensional Index Structures by Bulk Load Operations. Berchtold, Stefan, Christian Böhm, and Hans-Peter Kriegel. Advances in Database Technology—EDBT'98. Springer Berlin Heidelberg, 1998. 216-230.
- Data Ingestion for the Connected World Meehan, John, et al. CIDR, 2017.
Federated Databases and Polystores
- Protocols for Integrity Constraint Checking in Federated Databases. Grefen, Paul, and Jennifer Widom. Distributed and Parallel Databases 5.4 (1997): 327-355.
- Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases. Sheth, Amit P., and James A. Larson. ACM Computing Surveys (CSUR) 22.3 (1990): 183-236.
- Asterix: Scalable Warehouse-Style Web Data Integration. Alsubaiee, Sattam, et al. Proceedings of the Ninth International Workshop on Information Integration on the Web. ACM, 2012.
- The BigDAWG polystore system and architecture. Gadepally, Vijay, et al. High Performance Extreme Computing Conference (HPEC), 2016 IEEE. IEEE, 2016.
- Integrating real-time and batch processing in a polystore. Meehan, John, et al. High Performance Extreme Computing Conference (HPEC), 2016 IEEE. IEEE, 2016.
Publish / Subscribe
- The Many Faces of Publish/Subscribe. Eugster, Patrick Th, et al. ACM Computing Surveys (CSUR) 35.2 (2003): 114-131.
- Design Considerations for High Fan-In Systems: The HiFi Approach.Franklin, Michael J., et al. CIDR. 2005.
- HiFi: A Unified Architecture for High Fan-in SystemsCooper, Owen, et al. Proceedings of the Thirtieth international conference on Very large data bases-Volume 30. VLDB Endowment, 2004.
Schema Integration
- Schema Mapping as Query Discovery.Miller, Renée J., Laura M. Haas, and Mauricio A. Hernández. VLDB. Vol. 2000. 2000.
- Clio: A semi-automatic tool for schema mapping.Hernández, Mauricio A., Renée J. Miller, and Laura M. Haas. ACM SIGMOD Record 30.2 (2001): 607.
- Data Curation at Scale: The Data Tamer System.Stonebraker, Michael, et al. CIDR. 2013.
- Semantic heterogeneity resolution in federated databases by metadata implantation and stepwise evolution.Aslan, Goksel, and Dennis McLeod. The VLDB Journal—The International Journal on Very Large Data Bases 8.2 (1999): 120-132.
- The Data Civilizer System.Deng, Dong, et al. CIDR, 2017
Array Databases
- Salem: Query Processing Techniques for Arrays.Arunprasad P. Marathe, Kenneth. SIGMOD Conference 1999: 323-334.
- Salem: A Language for Manipulating ArraysArunprasad P. Marathe, Kenneth. VLDB 1997: 46-55.
- The TileDB Array Data Storage ManagerStavros Papadopoulos, Kushal Datta, Samuel Madden, Timothy Mattson. VLDB 2016