Papers:
NOTE: You must be behind the department firewall in order to access these papers.
-
Applications
- Gray, J., et al., Scientific data management in the coming decade. SIGMOD Rec., 2005. 34(4): p. 34-41.
- Szalay, A.S., et al., Designing and mining multi-terabyte astronomy archives: the Sloan Digital Sky Survey, in Proceedings of the 2000 ACM SIGMOD international conference on Management of data. 2000, ACM: Dallas, Texas, United States.
-
Data Models
- Data Exchange Formats: NetCDF, HDF5
- Array Data Models: Libkin, L., R. Machlin, and L. Wong, A query language for multidimensional arrays: design, implementation, and optimization techniques, in Proceedings of the 1996 ACM SIGMOD international conference on Management of data. 1996, ACM: Montreal, Quebec, Canada.
- Array Data Models: Marathe, A.P. and K. Salem, Query processing techniques for arrays, in Proceedings of the 1999 ACM SIGMOD international conference on Management of data. 1999, ACM: Philadelphia, Pennsylvania, United States.
- Array Data Models: Marathe, A.P. and K. Salem, Query processing techniques for arrays. The VLDB Journal, 2002. 11(1): p. 68-91.
-
Workflows, Provenance, and Lineage
- Altintas, I., et al., Introduction to scientific workflow management and the Kepler system, in Proceedings of the 2006 ACM/IEEE conference on Supercomputing. 2006, ACM: Tampa, Florida.
- Mandal, N., et al., Integrating existing scientific workflow systems: the Kepler/Pegasus example, in Proceedings of the 2nd workshop on Workflows in support of large-scale science. 2007, ACM: Monterey, California, USA.
- Pennington, D.D., Supporting large-scale science with workflows, in Proceedings of the 2nd workshop on Workflows in support of large-scale science. 2007, ACM: Monterey, California, USA.
- Heinis, T. and G. Alonso, Efficient lineage tracking for scientific workflows, in Proceedings of the 2008 ACM SIGMOD international conference on Management of data. 2008, ACM: Vancouver, Canada.
- Buneman, P., A. Chapman, and J. Cheney, Provenance management in curated databases, in Proceedings of the 2006 ACM SIGMOD international conference on Management of data. 2006, ACM: Chicago, IL, USA.
- Buneman, P. and W.-C. Tan, Provenance in databases, in Proceedings of the 2007 ACM SIGMOD international conference on Management of data. 2007, ACM: Beijing, China.
- Buneman, P., J. Cheney, and S. Vansummeren, On the expressiveness of implicit provenance in query and update languages. ACM Trans. Database Syst., 2008. 33(4): p. 1-47.
- Simmhan, Y.L., B. Plale, and D. Gannon, A survey of data provenance in e-science. SIGMOD Rec., 2005. 34(3): p. 31-36.
- Moreau, L., et al., The provenance of electronic data. Commun. ACM, 2008. 51(4): p. 52-58.
- Chiticariu, L., W.-C. Tan, and G. Vijayvargiya, DBNotes: a post-it system for relational databases based on provenance, in Proceedings of the 2005 ACM SIGMOD international conference on Management of data. 2005, ACM: Baltimore, Maryland.
- Cheney, J., et al., Report on the Principles of Provenance Workshop. SIGMOD Rec., 2008. 37(1): p. 62-65.
-
Uncertainty Management
- Benjelloun, O., et al., ULDBs: databases with uncertainty and lineage, in Proceedings of the 32nd international conference on Very large data bases. 2006, VLDB Endowment: Seoul, Korea.
- Benjelloun, O., et al., Databases with uncertainty and lineage. The VLDB Journal, 2008. 17(2): p. 243-264.
- Cheng, R., D.V. Kalashnikov, and S. Prabhakar, Evaluating probabilistic queries over imprecise data, in Proceedings of the 2003 ACM SIGMOD international conference on Management of data. 2003, ACM: San Diego, California.
- Soliman, M.A., I.F. Ilyas, and K.C.-C. Chang, Probabilistic top-k and ranking-aggregate queries. ACM Trans. Database Syst., 2008. 33(3): p. 1-54.
- Suciu, D., Probabilistic databases. SIGACT News, 2008. 39(2): p. 111-124.
- Wang, D.Z., et al., BayesStore: managing large, uncertain data repositories with probabilistic graphical models. Proc. VLDB Endow., 2008. 1(1): p. 340-351.
- Jampani, R., et al., MCDB: a monte carlo approach to managing uncertain data, in Proceedings of the 2008 ACM SIGMOD international conference on Management of data. 2008, ACM: Vancouver, Canada.
- Singh, S., et al., Orion 2.0: native support for uncertain data, in Proceedings of the 2008 ACM SIGMOD international conference on Management of data. 2008, ACM: Vancouver, Canada.
- Soliman, M.A., I.F. Ilyas, and K.C.-C. Chang, URank: formulation and efficient evaluation of top-k queries in uncertain databases, in Proceedings of the 2007 ACM SIGMOD international conference on Management of data. 2007, ACM: Beijing, China.
- Re, C. and D. Suciu, Management of data with uncertainties, in Proceedings of the sixteenth ACM conference on Conference on information and knowledge management. 2007, ACM: Lisbon, Portugal.
-
Integration / Sharing
- Green, T.J., et al., ORCHESTRA: facilitating collaborative data sharing, in Proceedings of the 2007 ACM SIGMOD international conference on Management of data. 2007, ACM: Beijing, China.
- Ives, Z.G., et al., The ORCHESTRA Collaborative Data Sharing System. SIGMOD Rec., 2008. 37(3): p. 26-32.
- Talukdar, P.P., et al., Learning to create data-integrating queries. Proc. VLDB Endow., 2008. 1(1): p. 785-796.
- Taylor, N.E. and Z.G. Ives, Reconciling while tolerating disagreement in collaborative data sharing, in Proceedings of the 2006 ACM SIGMOD international conference on Management of data. 2006, ACM: Chicago, IL, USA.
-
Cluster/Grid Computing & Storage Systems
- Otoo, E.J., D. Rotem, and S. Seshadri, Optimal chunking of large multidimensional arrays for data warehousing, in Proceedings of the ACM tenth international workshop on Data warehousing and OLAP. 2007, ACM: Lisbon, Portugal.
- Sarawagi, S. and M. Stonebraker, Efficient Organization of Large Multidimensional Arrays, in Proceedings of the Tenth International Conference on Data Engineering. 1994, IEEE Computer Society.
- Singh, G., C. Kesselman, and E. Deelman, A provisioning model and its comparison with best-effort for performance-cost optimization in grids, in Proceedings of the 16th international symposium on High performance distributed computing. 2007, ACM: Monterey, California, USA.
- Venugopal, S., R. Buyya, and K. Ramamohanarao, A taxonomy of Data Grids for distributed data sharing, management, and processing. ACM Comput. Surv., 2006. 38(1): p. 3.
-
Spatial Database Systems
- Guenther, O. and A. Buchmann, Research issues in spatial databases. SIGMOD Rec., 1990. 19(4): p. 61-68.
- Guting, R.H., An introduction to spatial database systems. The VLDB Journal, 1994. 3(4): p. 357-399.
- Pauly, A. and M. Schneider, Spatial vagueness and imprecision in databases, in Proceedings of the 2008 ACM symposium on Applied computing. 2008, ACM: Fortaleza, Ceara, Brazil.
- Rodriguez, M.A., L. Bertossi, and M. Caniup, An inconsistency tolerant approach to querying spatial databases, in Proceedings of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems. 2008, ACM: Irvine, California.
- Vandeurzen, L., M. Gyssens, and D.V. Gucht, An expressive language for linear spatial database queries, in Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems. 1998, ACM: Seattle, Washington, United States.
- Example System: DeWitt, D.J., et al., Clustera: an integrated computation and data management system. Proc. VLDB Endow., 2008. 1(1): p. 28-41.
- Example System: Johnson, R.R., et al., USD - a database management system for scientific research, in Proceedings of the 1992 ACM SIGMOD international conference on Management of data. 1992, ACM: San Diego, California, United States.
- Example System: Chang, C., et al., T2: a customizable parallel database for multi-dimensional data. SIGMOD Rec., 1998. 27(1): p. 58-66.
- Example System: Agrawal, P., et al., Trio: a system for data, uncertainty, and lineage, in Proceedings of the 32nd international conference on Very large data bases. 2006, VLDB Endowment: Seoul, Korea.