Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Enabling big geoscience data analytics with a cloud-based, MapReduce-enabled and service-oriented workflow framework.

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • Additional Information
    • Source:
      Publisher: Public Library of Science Country of Publication: United States NLM ID: 101285081 Publication Model: eCollection Cited Medium: Internet ISSN: 1932-6203 (Electronic) Linking ISSN: 19326203 NLM ISO Abbreviation: PLoS One Subsets: MEDLINE
    • Publication Information:
      Original Publication: San Francisco, CA : Public Library of Science
    • Subject Terms:
    • Abstract:
      Geoscience observations and model simulations are generating vast amounts of multi-dimensional data. Effectively analyzing these data are essential for geoscience studies. However, the tasks are challenging for geoscientists because processing the massive amount of data is both computing and data intensive in that data analytics requires complex procedures and multiple tools. To tackle these challenges, a scientific workflow framework is proposed for big geoscience data analytics. In this framework techniques are proposed by leveraging cloud computing, MapReduce, and Service Oriented Architecture (SOA). Specifically, HBase is adopted for storing and managing big geoscience data across distributed computers. MapReduce-based algorithm framework is developed to support parallel processing of geoscience data. And service-oriented workflow architecture is built for supporting on-demand complex data analytics in the cloud environment. A proof-of-concept prototype tests the performance of the framework. Results show that this innovative framework significantly improves the efficiency of big geoscience data analytics by reducing the data processing time as well as simplifying data analytical procedures for geoscientists.
    • References:
      Proc Natl Acad Sci U S A. 2011 Apr 5;108(14):5498-503. (PMID: 21444779)
      Avicenna J Med. 2015 Jan-Mar;5(1):1-5. (PMID: 25625082)
      Proc Natl Acad Sci U S A. 2011 Apr 5;108(14):5488-91. (PMID: 21467227)
      Bioinformatics. 2004 Nov 22;20(17):3045-54. (PMID: 15201187)
      Nature. 2004 Aug 12;430(7001):768-72. (PMID: 15306806)
    • Publication Date:
      Date Created: 20150306 Date Completed: 20160202 Latest Revision: 20231104
    • Publication Date:
      20250114
    • Accession Number:
      PMC4351198
    • Accession Number:
      10.1371/journal.pone.0116781
    • Accession Number:
      25742012