Overview of SciDB

Large scale array storage, processing and analysis

J. Rogers*, R. Simakov, E. Soroush, P. Velikhov, M. Balazinska, D. DeWitt, B. Heath, D. Maier, S. Madden, J. Patel, M. Stonebraker, S. Zdonik, A. Smirnov, K. Knizhnik, Paul G. Brown

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

180 Citations (Scopus)

Abstract

SciDB [4, 3] is a new open-source data management system intended primarily for use in application domains that involve very large (petabyte) scale array data; for example, scientific applications such as astronomy, remote sensing and climate modeling, bio-science information management, risk management systems in financial applications, and the analysis of web log data. In this talk we will describe our set of motivating examples and use them to explain the features of SciDB. We then briefly give an overview of the project 'in flight', explaining our novel storage manager, array data model, query language, and extensibility frameworks.

Original languageEnglish (US)
Title of host publicationProceedings of the 2010 International Conference on Management of Data, SIGMOD '10
Pages963-968
Number of pages6
DOIs
StatePublished - Jul 23 2010
Event2010 International Conference on Management of Data, SIGMOD '10 - Indianapolis, IN, United States
Duration: Jun 6 2010Jun 11 2010

Publication series

NameProceedings of the ACM SIGMOD International Conference on Management of Data
ISSN (Print)0730-8078

Other

Other2010 International Conference on Management of Data, SIGMOD '10
CountryUnited States
CityIndianapolis, IN
Period6/6/106/11/10

Fingerprint

Information management
Processing
Astronomy
Query languages
Risk management
Data structures
Remote sensing
Managers

Keywords

  • acm sigmod industrial proceedings scientific data management

ASJC Scopus subject areas

  • Software
  • Information Systems

Cite this

Rogers, J., Simakov, R., Soroush, E., Velikhov, P., Balazinska, M., DeWitt, D., ... Brown, P. G. (2010). Overview of SciDB: Large scale array storage, processing and analysis. In Proceedings of the 2010 International Conference on Management of Data, SIGMOD '10 (pp. 963-968). (Proceedings of the ACM SIGMOD International Conference on Management of Data). https://doi.org/10.1145/1807167.1807271
Rogers, J. ; Simakov, R. ; Soroush, E. ; Velikhov, P. ; Balazinska, M. ; DeWitt, D. ; Heath, B. ; Maier, D. ; Madden, S. ; Patel, J. ; Stonebraker, M. ; Zdonik, S. ; Smirnov, A. ; Knizhnik, K. ; Brown, Paul G. / Overview of SciDB : Large scale array storage, processing and analysis. Proceedings of the 2010 International Conference on Management of Data, SIGMOD '10. 2010. pp. 963-968 (Proceedings of the ACM SIGMOD International Conference on Management of Data).
@inproceedings{35bf1b416e494201955bd9408ff4cd68,
title = "Overview of SciDB: Large scale array storage, processing and analysis",
abstract = "SciDB [4, 3] is a new open-source data management system intended primarily for use in application domains that involve very large (petabyte) scale array data; for example, scientific applications such as astronomy, remote sensing and climate modeling, bio-science information management, risk management systems in financial applications, and the analysis of web log data. In this talk we will describe our set of motivating examples and use them to explain the features of SciDB. We then briefly give an overview of the project 'in flight', explaining our novel storage manager, array data model, query language, and extensibility frameworks.",
keywords = "acm sigmod industrial proceedings scientific data management",
author = "J. Rogers and R. Simakov and E. Soroush and P. Velikhov and M. Balazinska and D. DeWitt and B. Heath and D. Maier and S. Madden and J. Patel and M. Stonebraker and S. Zdonik and A. Smirnov and K. Knizhnik and Brown, {Paul G.}",
year = "2010",
month = "7",
day = "23",
doi = "10.1145/1807167.1807271",
language = "English (US)",
isbn = "9781450300322",
series = "Proceedings of the ACM SIGMOD International Conference on Management of Data",
pages = "963--968",
booktitle = "Proceedings of the 2010 International Conference on Management of Data, SIGMOD '10",

}

Rogers, J, Simakov, R, Soroush, E, Velikhov, P, Balazinska, M, DeWitt, D, Heath, B, Maier, D, Madden, S, Patel, J, Stonebraker, M, Zdonik, S, Smirnov, A, Knizhnik, K & Brown, PG 2010, Overview of SciDB: Large scale array storage, processing and analysis. in Proceedings of the 2010 International Conference on Management of Data, SIGMOD '10. Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 963-968, 2010 International Conference on Management of Data, SIGMOD '10, Indianapolis, IN, United States, 6/6/10. https://doi.org/10.1145/1807167.1807271

Overview of SciDB : Large scale array storage, processing and analysis. / Rogers, J.; Simakov, R.; Soroush, E.; Velikhov, P.; Balazinska, M.; DeWitt, D.; Heath, B.; Maier, D.; Madden, S.; Patel, J.; Stonebraker, M.; Zdonik, S.; Smirnov, A.; Knizhnik, K.; Brown, Paul G.

Proceedings of the 2010 International Conference on Management of Data, SIGMOD '10. 2010. p. 963-968 (Proceedings of the ACM SIGMOD International Conference on Management of Data).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Overview of SciDB

T2 - Large scale array storage, processing and analysis

AU - Rogers, J.

AU - Simakov, R.

AU - Soroush, E.

AU - Velikhov, P.

AU - Balazinska, M.

AU - DeWitt, D.

AU - Heath, B.

AU - Maier, D.

AU - Madden, S.

AU - Patel, J.

AU - Stonebraker, M.

AU - Zdonik, S.

AU - Smirnov, A.

AU - Knizhnik, K.

AU - Brown, Paul G.

PY - 2010/7/23

Y1 - 2010/7/23

N2 - SciDB [4, 3] is a new open-source data management system intended primarily for use in application domains that involve very large (petabyte) scale array data; for example, scientific applications such as astronomy, remote sensing and climate modeling, bio-science information management, risk management systems in financial applications, and the analysis of web log data. In this talk we will describe our set of motivating examples and use them to explain the features of SciDB. We then briefly give an overview of the project 'in flight', explaining our novel storage manager, array data model, query language, and extensibility frameworks.

AB - SciDB [4, 3] is a new open-source data management system intended primarily for use in application domains that involve very large (petabyte) scale array data; for example, scientific applications such as astronomy, remote sensing and climate modeling, bio-science information management, risk management systems in financial applications, and the analysis of web log data. In this talk we will describe our set of motivating examples and use them to explain the features of SciDB. We then briefly give an overview of the project 'in flight', explaining our novel storage manager, array data model, query language, and extensibility frameworks.

KW - acm sigmod industrial proceedings scientific data management

UR - http://www.scopus.com/inward/record.url?scp=77954740463&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77954740463&partnerID=8YFLogxK

U2 - 10.1145/1807167.1807271

DO - 10.1145/1807167.1807271

M3 - Conference contribution

SN - 9781450300322

T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data

SP - 963

EP - 968

BT - Proceedings of the 2010 International Conference on Management of Data, SIGMOD '10

ER -

Rogers J, Simakov R, Soroush E, Velikhov P, Balazinska M, DeWitt D et al. Overview of SciDB: Large scale array storage, processing and analysis. In Proceedings of the 2010 International Conference on Management of Data, SIGMOD '10. 2010. p. 963-968. (Proceedings of the ACM SIGMOD International Conference on Management of Data). https://doi.org/10.1145/1807167.1807271