logging in or signing up ok super comp symp2006 talk voran Malbern Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINT Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 82 Category: Entertainment License: All Rights Reserved Like it (0) Dislike it (0) Added: September 18, 2007 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript Blue Gene Experience at the National Center for Atmospheric ResearchOctober 4, 2006: Blue Gene Experience at the National Center for Atmospheric Research October 4, 2006 Theron Voran voran@ucar.edu Why Blue Gene?: Why Blue Gene? Extreme scalability, balanced architecture, simple design Efficient energy usage A change from IBM Power systems at NCAR But familiar Programming model Chip (similar to Power4) Linux on front-end and IO nodes Interesting research platform Outline: Outline System Overview Applications In the Classroom Scheduler Development TeraGrid Integration Other Current Research Activities Frost Fun Facts: Frost Fun Facts Collaborative effort Univ of Colorado at Boulder (CU) NCAR Univ of Colorado at Denver Debuted in June 2005, tied for 58th place on Top500 5.73 Tflops peak – 4.71 sustained 25KW loaded power usage 4 front-ends, 1 service node 6TB usable storage Why is it leaning? Henry Tufo and Rich Loft, with Frost System Internals: System Internals Blue Gene/L system on-a-chip More Details: More Details Chips PPC440 @700MHZ, 2 cores per node 512 MB memory per node Coprocessor vs Virtual Node 1:32 IO to Compute ratio Interconnects 3D Torus (154 MB/s one direction) Tree (354 MB/s) Global Interrupt GigE JTAG/IDO Storage 4 Power5 systems as GPFS cluster NFS export to BGL IO nodes Frost Utilization: Frost Utilization HOMME: HOMME High Order Method Modeling Environment Spectral element dynamical core Proved scalable on other platforms Cubed-sphere topology Space-filling curves HOMME Performance: HOMME Performance Ported in 2004 on BG/L prototype at TJ Watson, with eventual goal of Gordon Bell submission in 2005 Serial and parallel obstacles: SIMD instructions Eager vs Adaptive routing Mapping strategies Result: Good scalability out to 32,768 processors (3 elements per processor) Snake mapping on 8x8x8 3D torus HOMME Scalability on 32 Racks: HOMME Scalability on 32 Racks Other Applications: Other Applications Popular codes on Frost WRF CAM, POP, CICE MPIKAIA EULAG BOB PETSc Used as a scalability test bed, in preparation for runs on 20-rack BG/W system Classroom Access: Classroom Access Henry Tufo’s ‘High Performance Scientific Computing’ course at University of Colorado Let students loose on 2048 processors Thinking BIG Throughput and latency studies Scalability tests - Conway’s Game of Life Final projects Feedback from ‘novice’ HPC users Cobalt: Cobalt Component-Based Lightweight Toolkit Open source resource manager and scheduler Developed by ANL along with NCAR/CU Component Architecture Communication via XML-RPC Process manager, queue manager, scheduler ~3000 lines of python code Manages traditional clusters also http://www.mcs.anl.gov/cobalt Cobalt Architecture: Cobalt Architecture Cobalt Development Areas: Cobalt Development Areas Scheduler improvements Efficient packing Multi-rack challenges Simulation ability Tunable scheduling parameters Visualization Aid in scheduler development Give users (and admins) better understanding of machine allocation Accounting / project management and logging Blue Gene/P TeraGrid integration NCAR joins the TeraGrid, June 2006: NCAR joins the TeraGrid, June 2006 TeraGrid Testbed: TeraGrid Testbed CSS Switch Computational Cluster Teragrid NLR CU experimental Storage Cluster Experimental Environment Production Environment NETS Switch Datagrid NCAR 1GBNet FRGP TeraGrid Activities: TeraGrid Activities Grid-enabling Frost Common TeraGrid Software Stack (CTSS) Grid Resource Allocation Manager (GRAM) and Cobalt interoperability Security infrastructure Storage Cluster 16 OSTs, 50-100 TB usable storage 10G connectivity GPFS-WAN Lustre-WAN Other Current Research Activities: Other Current Research Activities Scalability of CCSM components POP CICE Scalable solver experiments Efficient communication mapping Coupled climate models Petascale parallelism Meta-scheduling Across sites Cobalt vs other schedulers Storage PVFS2 + ZeptoOS Lustre Frost has been a success as a …: Frost has been a success as a … Research experiment Utilization rates Educational tool Classroom Fertile ground for grad students Development platform Petascale problems Systems work Questions? voran@ucar.edu https://wiki.cs.colorado.edu/BlueGeneWiki You do not have the permission to view this presentation. In order to view it, please contact the author of the presentation.
ok super comp symp2006 talk voran Malbern Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINT Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 82 Category: Entertainment License: All Rights Reserved Like it (0) Dislike it (0) Added: September 18, 2007 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript Blue Gene Experience at the National Center for Atmospheric ResearchOctober 4, 2006: Blue Gene Experience at the National Center for Atmospheric Research October 4, 2006 Theron Voran voran@ucar.edu Why Blue Gene?: Why Blue Gene? Extreme scalability, balanced architecture, simple design Efficient energy usage A change from IBM Power systems at NCAR But familiar Programming model Chip (similar to Power4) Linux on front-end and IO nodes Interesting research platform Outline: Outline System Overview Applications In the Classroom Scheduler Development TeraGrid Integration Other Current Research Activities Frost Fun Facts: Frost Fun Facts Collaborative effort Univ of Colorado at Boulder (CU) NCAR Univ of Colorado at Denver Debuted in June 2005, tied for 58th place on Top500 5.73 Tflops peak – 4.71 sustained 25KW loaded power usage 4 front-ends, 1 service node 6TB usable storage Why is it leaning? Henry Tufo and Rich Loft, with Frost System Internals: System Internals Blue Gene/L system on-a-chip More Details: More Details Chips PPC440 @700MHZ, 2 cores per node 512 MB memory per node Coprocessor vs Virtual Node 1:32 IO to Compute ratio Interconnects 3D Torus (154 MB/s one direction) Tree (354 MB/s) Global Interrupt GigE JTAG/IDO Storage 4 Power5 systems as GPFS cluster NFS export to BGL IO nodes Frost Utilization: Frost Utilization HOMME: HOMME High Order Method Modeling Environment Spectral element dynamical core Proved scalable on other platforms Cubed-sphere topology Space-filling curves HOMME Performance: HOMME Performance Ported in 2004 on BG/L prototype at TJ Watson, with eventual goal of Gordon Bell submission in 2005 Serial and parallel obstacles: SIMD instructions Eager vs Adaptive routing Mapping strategies Result: Good scalability out to 32,768 processors (3 elements per processor) Snake mapping on 8x8x8 3D torus HOMME Scalability on 32 Racks: HOMME Scalability on 32 Racks Other Applications: Other Applications Popular codes on Frost WRF CAM, POP, CICE MPIKAIA EULAG BOB PETSc Used as a scalability test bed, in preparation for runs on 20-rack BG/W system Classroom Access: Classroom Access Henry Tufo’s ‘High Performance Scientific Computing’ course at University of Colorado Let students loose on 2048 processors Thinking BIG Throughput and latency studies Scalability tests - Conway’s Game of Life Final projects Feedback from ‘novice’ HPC users Cobalt: Cobalt Component-Based Lightweight Toolkit Open source resource manager and scheduler Developed by ANL along with NCAR/CU Component Architecture Communication via XML-RPC Process manager, queue manager, scheduler ~3000 lines of python code Manages traditional clusters also http://www.mcs.anl.gov/cobalt Cobalt Architecture: Cobalt Architecture Cobalt Development Areas: Cobalt Development Areas Scheduler improvements Efficient packing Multi-rack challenges Simulation ability Tunable scheduling parameters Visualization Aid in scheduler development Give users (and admins) better understanding of machine allocation Accounting / project management and logging Blue Gene/P TeraGrid integration NCAR joins the TeraGrid, June 2006: NCAR joins the TeraGrid, June 2006 TeraGrid Testbed: TeraGrid Testbed CSS Switch Computational Cluster Teragrid NLR CU experimental Storage Cluster Experimental Environment Production Environment NETS Switch Datagrid NCAR 1GBNet FRGP TeraGrid Activities: TeraGrid Activities Grid-enabling Frost Common TeraGrid Software Stack (CTSS) Grid Resource Allocation Manager (GRAM) and Cobalt interoperability Security infrastructure Storage Cluster 16 OSTs, 50-100 TB usable storage 10G connectivity GPFS-WAN Lustre-WAN Other Current Research Activities: Other Current Research Activities Scalability of CCSM components POP CICE Scalable solver experiments Efficient communication mapping Coupled climate models Petascale parallelism Meta-scheduling Across sites Cobalt vs other schedulers Storage PVFS2 + ZeptoOS Lustre Frost has been a success as a …: Frost has been a success as a … Research experiment Utilization rates Educational tool Classroom Fertile ground for grad students Development platform Petascale problems Systems work Questions? voran@ucar.edu https://wiki.cs.colorado.edu/BlueGeneWiki