logging in or signing up Synthetic Biology 091506 final Mercede Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINTLite Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 711 Category: Education License: All Rights Reserved Like it (0) Dislike it (0) Added: January 21, 2008 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript Building a Community Cyberinfrastructure to Support Marine Microbial Ecology Metagenomics: Building a Community Cyberinfrastructure to Support Marine Microbial Ecology Metagenomics Invited Talk 2006 Synthetic Biology Symposium Aliso Creek Inn Laguna Beach, CA September 15, 2006 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor, Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSDCalit2 Brings Computer Scientists and Engineers Together with Biomedical Researchers: Calit2 Brings Computer Scientists and Engineers Together with Biomedical Researchers Some Areas of Concentration: Metagenomics Genomic Analysis of Organisms Evolution of Genomes Cancer Genomics Human Genomic Variation & Disease Proteomics Mitochondrial Evolution Computational Biology & Bioinformatics Information Theory & Biological Systems UC San Diego UC Irvine 1200 Researchers in Two Buildings www.calit2.netMost of Evolutionary Time Was in the Microbial World: Most of Evolutionary Time Was in the Microbial World Source: Carl Woese, et al Tree of Life Derived from 16S rRNA SequencesMicrobial Genomics Let’s Us Look Back Nearly 4 Billion Years In the Evolution of Life: Microbial Genomics Let’s Us Look Back Nearly 4 Billion Years In the Evolution of Life Falkowski and Vargas Science 304 (5667) 2004Moore Microbial Genome Sequencing ProjectSelected Microbes Throughout the World’s Oceans: Moore Microbial Genome Sequencing Project Selected Microbes Throughout the World’s Oceans www.moore.org/microgenome/worldmap.asp Microbes Nominated by Leading Ocean Microbial BiologistsMoore Foundation Funded the Venter Institute to Provide the Full Genome Sequence of 150 Marine Microbes: Moore Foundation Funded the Venter Institute to Provide the Full Genome Sequence of 150 Marine Microbes www.moore.org/microgenome/trees_main.aspMoore Microbial Genome Sequencing Project: Cyanobacteria Being Sequenced by Venter Institute: Moore Microbial Genome Sequencing Project: Cyanobacteria Being Sequenced by Venter InstituteFull Genome Sequencing is Exploding:Most Sequenced Genomes are Bacterial: Full Genome Sequencing is Exploding: Most Sequenced Genomes are Bacterial www.genomesonline.org 55 Metagenomes First Genome 1995 6 Genomes/ Year 2000 Moore 155 In HereMicrobial Metagenomics is a Rapidly Emerging Field of Research: Microbial Metagenomics is a Rapidly Emerging Field of Research “Despite their ubiquity, relatively little is known about the majority of environmental microorganisms, largely because of their resistance to culture under standard laboratory conditions.” “The application of high-throughput shotgun sequencing environmental samples has recently provided global views of those communities not obtainable from 16S rRNA or BAC clone–sequencing surveys .” Comparative Metagenomics of Microbial Communities Susannah Green Tringe, Christian von Mering, Arthur Kobayashi, Asaf A. Salamov, Kevin Chen, Hwai W. Chang, Mircea Podar, Jay M. Short, Eric J. Mathur, John C. Detter, Peer Bork, Philip Hugenholtz, Edward M. Rubin Science 22 April 2005The Sargasso Sea Experiment The Power of Environmental Metagenomics: The Sargasso Sea Experiment The Power of Environmental Metagenomics Yielded a Total of Over 1 billion Base Pairs of Non-Redundant Sequence Displayed the Gene Content, Diversity, & Relative Abundance of the Organisms Sequences from at Least 1800 Genomic Species, including 148 Previously Unknown Identified over 1.2 Million Unknown Genes MODIS-Aqua satellite image of ocean chlorophyll in the Sargasso Sea grid about the BATS site from 22 February 2003 J. Craig Venter, et al. Science 2 April 2004: Vol. 304. pp. 66 - 74 Marine Genome Sequencing Project – Measuring the Genetic Diversity of Ocean Microbes: Marine Genome Sequencing Project – Measuring the Genetic Diversity of Ocean Microbes Sorcerer II Data Will Double Number of Proteins in GenBank!GOS Sequences are Largely Bacterial: GOS Sequences are Largely Bacterial Source: Shibu Yooseph, et al. (PLOS Biology in press 2006) ~3 Million Previously Known Sequences ~5.6 Million GOS SequencesGOS Analysis -- Protein Families in Nature Have Been Poorly Explored Thus Far: GOS Analysis -- Protein Families in Nature Have Been Poorly Explored Thus Far Novel Sequence Similarity Clustering Process Predicts Proteins and Groups Related Sequences Into Clusters (Families) GOS Proteins Increase Size / Diversity of Many Protein Families 1,700 Novel GOS-Only Clusters Identified (>20 per Cluster) 10% of 17,000 Clusters Source: Shibu Yooseph, Granger Sutton, --JCVI NCBI_nr GOS + NCBI_nr + Ensembl + TIGR Gene Indices + Prokaryotic Genomes Current Universe of Medium/ Large Protein Families: Current Universe of Medium/ Large Protein Families Source: Shibu Yooseph, et al. (PLOS Biology in press 2006) Protein Families Conserved Across Tree of Life Protein Families Unique to GOS 17,067 Protein Family ClustersMetagenomic Data SetsAre Rapidly Being Accumulated: Metagenomic Data Sets Are Rapidly Being Accumulated “A majority of the bacterial sequences corresponded to uncultivated species and novel microorganisms.” “We discovered significant inter-subject variability.” “Characterization of this immensely diverse ecosystem is the first step in elucidating its role in health and disease.” “Diversity of the Human Intestinal Microbial Flora” Paul B. Eckburg, et al Science (10 June 2005) 395 PhylotypesMicrobes Form the Base of the Living World: Microbes Form the Base of the Living World White Filamentous Bacteria on 'Pill Bug' Outer Carapace Source: John Delaney and Research Channel, U Washington High Definition Still Frame of Hydrothermal Vent Ecology 2.3 Km Deep Slide17: PI Larry Smarr Announced January 17, 2006 $24.5M Over Seven YearsPaul Gilna Has Been Recruited from Los Alamos to Become Calit2’s Executive Director of CAMERA: Paul Gilna Has Been Recruited from Los Alamos to Become Calit2’s Executive Director of CAMERA Formerly Former Director of the Department of Energy’s Joint Genome Institute (JGI) Operations at Los Alamos National Laboratory (LANL) Group Leader of Genomic Science and Computational Biology in LANL’s Bioscience Division JGI A $70-million-per-Year Collaboration: Lawrence Berkeley, Lawrence Livermore, Los Alamos, Oak Ridge, and Pacific Northwest and the Stanford Human Genome Center Working at The Frontiers of Genome Sequencing and BiosciencesNational Lambda Rail (NLR) and TeraGrid Provides Cyberinfrastructure Backbone for U.S. Researchers: San Francisco Pittsburgh Cleveland National Lambda Rail (NLR) and TeraGrid Provides Cyberinfrastructure Backbone for U.S. Researchers San Diego Los Angeles Portland Seattle Pensacola Baton Rouge Houston San Antonio Las Cruces / El Paso Phoenix New York City Washington, DC Raleigh Jacksonville Dallas Tulsa Atlanta Kansas City Denver Ogden/ Salt Lake City Boise Albuquerque UC-TeraGrid UIC/NW-Starlight Chicago International Collaborators NLR 4 x 10Gb Lambdas Initially Capable of 40 x 10Gb wavelengths at Buildout NSF’s TeraGrid Has 4 x 10Gb Lambda Backbone Links Two Dozen State and Regional Optical Networks DOE, NSF, & NASA Using NLR Calit2’s Direct Access Core Architecture Will Create Next Generation Metagenomics Server: Traditional User Response Request Calit2’s Direct Access Core Architecture Will Create Next Generation Metagenomics Server Source: Phil Papadopoulos, SDSC, Calit2 + Web Services Sargasso Sea Data Sorcerer II Expedition (GOS) JGI Community Sequencing Project Moore Marine Microbial Project NASA and NOAA Satellite Data Community Microbial Metagenomics DataThe Future Home of the Moore Foundation Funded Marine Microbial Ecology Metagenomics Complex: The Future Home of the Moore Foundation Funded Marine Microbial Ecology Metagenomics Complex First Implementation of the CAMERA Complex Photo Courtesy Joe Keefe, Calit2 Major Buildout of Calit2 Server Room UnderwayAnalysis Data Sets, Data Services, Tools, and Workflows: Analysis Data Sets, Data Services, Tools, and Workflows Assemblies of Metagenomic Data e.g, GOS, JGI CSP Annotations Genomic and Metagenomic Data “All-against-all” Alignments of ORFs Updated Periodically Gene Clusters and Associated Data Profiles, Multiple-Sequence Alignments, HMMs, Phylogenies, Peptide Sequences Data Services ‘Raw’ and Specialized Analysis Data Rich Query Facilities Tools and Workflows Navigate and Sift Raw and Analysis Data Publish Workflows and Develop New Ones Prioritize Features via Dialogue with Community Source: Saul Kravitz Director of Software Engineering J. Craig Venter InstituteOptIPortal–Termination Device for the Dedicated Gigabit/sec Lightpaths: OptIPortal–Termination Device for the Dedicated Gigabit/sec Lightpaths Photo Source: David Lee, Mark Ellisman NCMIR, UCSD Collaborative Analysis of Large Scale Images of Cancer Cells Integration of High Definition Video Streams with Large Scale Image Display WallsEmerging OptIPortal Sites on the National LambdaRail: Dedicated 10 Gbps CAVEWave Connects San Diego to Seattle to Chicago to Washington D.C. NEW! NEW! SunLight CICESE UW JCVI MIT SIO UCSD SDSU UIC EVL UCI OptIPortals Emerging OptIPortal Sites on the National LambdaRailCAMERA Outreach Modes: CAMERA Outreach Modes Scientific Advisory Board Early Adopters – OptIPortal End Points Targeted Workshops User Forums User Software Testing Viz Tool Brainstorming Presentations at Scientific Meetings e.g. Demonstration Booth at JCVI Genomes, Medicine, and the Environment Conference October 2006 Partnerships With Metagenomics Projects E.g. DoE’s Joint Genome Institute (JGI) Training and User Services TeamTimeline: Sprint and Marathon: Timeline: Sprint and Marathon Sprint Release 0.0: April 2006 Test Cluster for UCSD/JCVI Collaboration Release 1.0: Late Fall 2006 Initial Data and Core Tools Release Supports Publication of GOS Papers Marathon Release 2.0: Fall 2007 Additional/Improved Tools & Better Usability Beyond 2.0 Move Towards Semantic DB Additional Tools Based on Community Feedback You do not have the permission to view this presentation. In order to view it, please contact the author of the presentation.
Synthetic Biology 091506 final Mercede Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINTLite Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 711 Category: Education License: All Rights Reserved Like it (0) Dislike it (0) Added: January 21, 2008 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript Building a Community Cyberinfrastructure to Support Marine Microbial Ecology Metagenomics: Building a Community Cyberinfrastructure to Support Marine Microbial Ecology Metagenomics Invited Talk 2006 Synthetic Biology Symposium Aliso Creek Inn Laguna Beach, CA September 15, 2006 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor, Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSDCalit2 Brings Computer Scientists and Engineers Together with Biomedical Researchers: Calit2 Brings Computer Scientists and Engineers Together with Biomedical Researchers Some Areas of Concentration: Metagenomics Genomic Analysis of Organisms Evolution of Genomes Cancer Genomics Human Genomic Variation & Disease Proteomics Mitochondrial Evolution Computational Biology & Bioinformatics Information Theory & Biological Systems UC San Diego UC Irvine 1200 Researchers in Two Buildings www.calit2.netMost of Evolutionary Time Was in the Microbial World: Most of Evolutionary Time Was in the Microbial World Source: Carl Woese, et al Tree of Life Derived from 16S rRNA SequencesMicrobial Genomics Let’s Us Look Back Nearly 4 Billion Years In the Evolution of Life: Microbial Genomics Let’s Us Look Back Nearly 4 Billion Years In the Evolution of Life Falkowski and Vargas Science 304 (5667) 2004Moore Microbial Genome Sequencing ProjectSelected Microbes Throughout the World’s Oceans: Moore Microbial Genome Sequencing Project Selected Microbes Throughout the World’s Oceans www.moore.org/microgenome/worldmap.asp Microbes Nominated by Leading Ocean Microbial BiologistsMoore Foundation Funded the Venter Institute to Provide the Full Genome Sequence of 150 Marine Microbes: Moore Foundation Funded the Venter Institute to Provide the Full Genome Sequence of 150 Marine Microbes www.moore.org/microgenome/trees_main.aspMoore Microbial Genome Sequencing Project: Cyanobacteria Being Sequenced by Venter Institute: Moore Microbial Genome Sequencing Project: Cyanobacteria Being Sequenced by Venter InstituteFull Genome Sequencing is Exploding:Most Sequenced Genomes are Bacterial: Full Genome Sequencing is Exploding: Most Sequenced Genomes are Bacterial www.genomesonline.org 55 Metagenomes First Genome 1995 6 Genomes/ Year 2000 Moore 155 In HereMicrobial Metagenomics is a Rapidly Emerging Field of Research: Microbial Metagenomics is a Rapidly Emerging Field of Research “Despite their ubiquity, relatively little is known about the majority of environmental microorganisms, largely because of their resistance to culture under standard laboratory conditions.” “The application of high-throughput shotgun sequencing environmental samples has recently provided global views of those communities not obtainable from 16S rRNA or BAC clone–sequencing surveys .” Comparative Metagenomics of Microbial Communities Susannah Green Tringe, Christian von Mering, Arthur Kobayashi, Asaf A. Salamov, Kevin Chen, Hwai W. Chang, Mircea Podar, Jay M. Short, Eric J. Mathur, John C. Detter, Peer Bork, Philip Hugenholtz, Edward M. Rubin Science 22 April 2005The Sargasso Sea Experiment The Power of Environmental Metagenomics: The Sargasso Sea Experiment The Power of Environmental Metagenomics Yielded a Total of Over 1 billion Base Pairs of Non-Redundant Sequence Displayed the Gene Content, Diversity, & Relative Abundance of the Organisms Sequences from at Least 1800 Genomic Species, including 148 Previously Unknown Identified over 1.2 Million Unknown Genes MODIS-Aqua satellite image of ocean chlorophyll in the Sargasso Sea grid about the BATS site from 22 February 2003 J. Craig Venter, et al. Science 2 April 2004: Vol. 304. pp. 66 - 74 Marine Genome Sequencing Project – Measuring the Genetic Diversity of Ocean Microbes: Marine Genome Sequencing Project – Measuring the Genetic Diversity of Ocean Microbes Sorcerer II Data Will Double Number of Proteins in GenBank!GOS Sequences are Largely Bacterial: GOS Sequences are Largely Bacterial Source: Shibu Yooseph, et al. (PLOS Biology in press 2006) ~3 Million Previously Known Sequences ~5.6 Million GOS SequencesGOS Analysis -- Protein Families in Nature Have Been Poorly Explored Thus Far: GOS Analysis -- Protein Families in Nature Have Been Poorly Explored Thus Far Novel Sequence Similarity Clustering Process Predicts Proteins and Groups Related Sequences Into Clusters (Families) GOS Proteins Increase Size / Diversity of Many Protein Families 1,700 Novel GOS-Only Clusters Identified (>20 per Cluster) 10% of 17,000 Clusters Source: Shibu Yooseph, Granger Sutton, --JCVI NCBI_nr GOS + NCBI_nr + Ensembl + TIGR Gene Indices + Prokaryotic Genomes Current Universe of Medium/ Large Protein Families: Current Universe of Medium/ Large Protein Families Source: Shibu Yooseph, et al. (PLOS Biology in press 2006) Protein Families Conserved Across Tree of Life Protein Families Unique to GOS 17,067 Protein Family ClustersMetagenomic Data SetsAre Rapidly Being Accumulated: Metagenomic Data Sets Are Rapidly Being Accumulated “A majority of the bacterial sequences corresponded to uncultivated species and novel microorganisms.” “We discovered significant inter-subject variability.” “Characterization of this immensely diverse ecosystem is the first step in elucidating its role in health and disease.” “Diversity of the Human Intestinal Microbial Flora” Paul B. Eckburg, et al Science (10 June 2005) 395 PhylotypesMicrobes Form the Base of the Living World: Microbes Form the Base of the Living World White Filamentous Bacteria on 'Pill Bug' Outer Carapace Source: John Delaney and Research Channel, U Washington High Definition Still Frame of Hydrothermal Vent Ecology 2.3 Km Deep Slide17: PI Larry Smarr Announced January 17, 2006 $24.5M Over Seven YearsPaul Gilna Has Been Recruited from Los Alamos to Become Calit2’s Executive Director of CAMERA: Paul Gilna Has Been Recruited from Los Alamos to Become Calit2’s Executive Director of CAMERA Formerly Former Director of the Department of Energy’s Joint Genome Institute (JGI) Operations at Los Alamos National Laboratory (LANL) Group Leader of Genomic Science and Computational Biology in LANL’s Bioscience Division JGI A $70-million-per-Year Collaboration: Lawrence Berkeley, Lawrence Livermore, Los Alamos, Oak Ridge, and Pacific Northwest and the Stanford Human Genome Center Working at The Frontiers of Genome Sequencing and BiosciencesNational Lambda Rail (NLR) and TeraGrid Provides Cyberinfrastructure Backbone for U.S. Researchers: San Francisco Pittsburgh Cleveland National Lambda Rail (NLR) and TeraGrid Provides Cyberinfrastructure Backbone for U.S. Researchers San Diego Los Angeles Portland Seattle Pensacola Baton Rouge Houston San Antonio Las Cruces / El Paso Phoenix New York City Washington, DC Raleigh Jacksonville Dallas Tulsa Atlanta Kansas City Denver Ogden/ Salt Lake City Boise Albuquerque UC-TeraGrid UIC/NW-Starlight Chicago International Collaborators NLR 4 x 10Gb Lambdas Initially Capable of 40 x 10Gb wavelengths at Buildout NSF’s TeraGrid Has 4 x 10Gb Lambda Backbone Links Two Dozen State and Regional Optical Networks DOE, NSF, & NASA Using NLR Calit2’s Direct Access Core Architecture Will Create Next Generation Metagenomics Server: Traditional User Response Request Calit2’s Direct Access Core Architecture Will Create Next Generation Metagenomics Server Source: Phil Papadopoulos, SDSC, Calit2 + Web Services Sargasso Sea Data Sorcerer II Expedition (GOS) JGI Community Sequencing Project Moore Marine Microbial Project NASA and NOAA Satellite Data Community Microbial Metagenomics DataThe Future Home of the Moore Foundation Funded Marine Microbial Ecology Metagenomics Complex: The Future Home of the Moore Foundation Funded Marine Microbial Ecology Metagenomics Complex First Implementation of the CAMERA Complex Photo Courtesy Joe Keefe, Calit2 Major Buildout of Calit2 Server Room UnderwayAnalysis Data Sets, Data Services, Tools, and Workflows: Analysis Data Sets, Data Services, Tools, and Workflows Assemblies of Metagenomic Data e.g, GOS, JGI CSP Annotations Genomic and Metagenomic Data “All-against-all” Alignments of ORFs Updated Periodically Gene Clusters and Associated Data Profiles, Multiple-Sequence Alignments, HMMs, Phylogenies, Peptide Sequences Data Services ‘Raw’ and Specialized Analysis Data Rich Query Facilities Tools and Workflows Navigate and Sift Raw and Analysis Data Publish Workflows and Develop New Ones Prioritize Features via Dialogue with Community Source: Saul Kravitz Director of Software Engineering J. Craig Venter InstituteOptIPortal–Termination Device for the Dedicated Gigabit/sec Lightpaths: OptIPortal–Termination Device for the Dedicated Gigabit/sec Lightpaths Photo Source: David Lee, Mark Ellisman NCMIR, UCSD Collaborative Analysis of Large Scale Images of Cancer Cells Integration of High Definition Video Streams with Large Scale Image Display WallsEmerging OptIPortal Sites on the National LambdaRail: Dedicated 10 Gbps CAVEWave Connects San Diego to Seattle to Chicago to Washington D.C. NEW! NEW! SunLight CICESE UW JCVI MIT SIO UCSD SDSU UIC EVL UCI OptIPortals Emerging OptIPortal Sites on the National LambdaRailCAMERA Outreach Modes: CAMERA Outreach Modes Scientific Advisory Board Early Adopters – OptIPortal End Points Targeted Workshops User Forums User Software Testing Viz Tool Brainstorming Presentations at Scientific Meetings e.g. Demonstration Booth at JCVI Genomes, Medicine, and the Environment Conference October 2006 Partnerships With Metagenomics Projects E.g. DoE’s Joint Genome Institute (JGI) Training and User Services TeamTimeline: Sprint and Marathon: Timeline: Sprint and Marathon Sprint Release 0.0: April 2006 Test Cluster for UCSD/JCVI Collaboration Release 1.0: Late Fall 2006 Initial Data and Core Tools Release Supports Publication of GOS Papers Marathon Release 2.0: Fall 2007 Additional/Improved Tools & Better Usability Beyond 2.0 Move Towards Semantic DB Additional Tools Based on Community Feedback