• Medientyp: E-Artikel
  • Titel: An approach for pipelining nested collections in scientific workflows
  • Beteiligte: McPhillips, Timothy M.; Bowers, Shawn
  • Erschienen: Association for Computing Machinery (ACM), 2005
  • Erschienen in: ACM SIGMOD Record, 34 (2005) 3, Seite 12-17
  • Sprache: Englisch
  • DOI: 10.1145/1084805.1084809
  • ISSN: 0163-5808
  • Entstehung:
  • Anmerkungen:
  • Beschreibung: We describe an approach for pipelining nested data collections in scientific workflows. Our approach logically delimits arbitrarily nested collections of data tokens using special, paired control tokens inserted into token streams, and provides workflow components with high-level operations for managing these collections. Our framework provides new capabilities for: (1) concurrent operation on collections; (2) on-the-fly customization of workflow component behavior; (3) improved handling of exceptions and faults; and (4) transparent passing of provenance and metadata within token streams. We demonstrate our approach using a workflow for inferring phylogenetic trees. We also describe future extensions to support richer typing mechanisms for facilitating sharing and reuse of workflow components between disciplines. This work represents a step towards our larger goal of exploiting collection-oriented dataflow programming as a new paradigm for scientific workflow systems, an approach we believe will significantly reduce the complexity of creating and reusing workflows and workflow components.
  • Zugangsstatus: Freier Zugang