logging in or signing up Production Science Data Groups aSGuest904 Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINT lite Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 22 Category: Science & Tech.. License: All Rights Reserved Like it (0) Dislike it (0) Added: October 14, 2008 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript PIPE Dreams : PIPE Dreams Trouble Shooting Network Performance for Production Science Data Grids Presented by Warren Matthews at CHEP’03, San Diego March 24-28, 2003 Abstract : Abstract The vision of science grids allocating resources to analyze huge quantities of HENP data clearly depends on reliable network performance. Tools developed at SLAC in conjunction with the Internet2 PIPES project will help to ensure this. In this talk, these tools will be discussed and the procedure for publishing performance data, in particular using the Globus toolkit's MDS and web services will be reviewed. The subsequent analysis and trouble-shooting methodology will be discussed with real world examples from the particle physics data grid (PPDG) and the European data grid (EDG). Overview : Overview What is the problem ? What is PIPES ? Network performance monitoring Problem identification Network Monitoring for the Grid : Resource Broker Farm Farm Farm Data Data Data requestor The Network Network Monitoring for the Grid The Data Grid consists of many components that must interoperate requestor Allocate Resources : Resource Broker Farm Farm Farm Data Data Data requestor The Network Allocate Resources The resource broker must be fully informed Measurement is required ! requestor 12% pkt loss OC48 80% Utilization What is PIPES ? : What is PIPES ? Internet2 End-to-end performance initiative PI Performance Evaluation System (PIPES) PIPES Monitoring Platform (PMP) Overlap with goals of HENP Tremendous resources IEPM-BW : IEPM-BW Package developed at SLAC Measurement Engine Iperf, bbftp, bbcp, ping, traceroute Abwe, owamp, udpmon, gridftp Job Manager Data Storage and data server Analysis Engine Slide 8: SNV SLAC CHI ESnet NY Stanford CalREN NERSC LANL JLAB TRIUMF KEK Abilene SLAC SNV FNAL ANL NIKHEF CERN IN2P3 CERN CALTECH SDSC BNL JAnet HSTN SEA ATL CLV IPLS RAL UCL UManc DL NNW NY Rice UTDallas NCSA UMich I2 SOX UFL APAN RIKEN INFN-Roma INFN-Milan CESnet APAN Geant EDG PPDG/GriPhyN Monitoring Site ORNL Stanford UTAH DNVR ORNL NASA WASH Imperial INFN-Padua Slide 9: SLAC Manchester Bristol Dresden IN2P3 RAL Stanford Calren Abilene Renater DFN Janet NNW TVN SWERN ESnet BaBar Grid Geant 622Mbps 2.5 Gbps 1 Gbps 10 Gbps Problem Identification : Problem Identification Typical Scenario User complains file transfer is slow Net admin runs ping, traceroute, iperf test Complain to upstream provider Proactive What do we mean by throughput? How do we know there was a performance hit? Our approach is diurnal changes Alarms : Alarms Too much to keep track of Rather not wait for complaints Automated Alarms Rolling average à la RIPE-TT May not be the best approach AMP Automated Detection System Limitations : Limitations Could be over an hour before alarm is generated More frequent measurements impact the network and measurements overlap Low impact tools allow finer grained measurement Use NWS multi-variate method Use SCIDAC ABwE tool Use PingER, OWAMP Publishing : Publishing Many monitoring projects, publish data to allow them to inter-operate MDS EDG NM Schema Web Services GLUE NE Schema GGF NMWG Hierarchy Doc Tools Doc ./get_data 2003 3 18 6 1 41 1.61 1.601 1.62 0 Net Rat : Net Rat Alarm System Multiple tools Multiple measurement points Trigger further measurements Cross reference off site stats Informant database No measurement is ‘authoritative’ Cannot even believe a measurement Log : Log 03/20/2003 20:13:46 ALARM pcgiga throughput=305.224 ctresh=512.95 athresh=312.91 03/20/2003 20:13:48 TRACE no change in route detected 03/20/2003 20:16:07 CALM Throughput within acceptable limits. ALARM CANCELLED Toward a Monitoring Infrastructure : Toward a Monitoring Infrastructure MAGGIE Measurement and Analysis package built on NIMI/Akenti EDEE production-quality Data Grid for Europe More Information : More Information IEPM Home Page IEPM-BW I2 E2E and PIPES RIPE-TT AMP Automated Event Detection NWS ABWE End : End This talk made possible by the IEPM team at SLAC (Les Cottrell, Connie Logg, Jiri Navratil, Jerrod Williams, Fabrizio Coccetti), and the many developers and maintainers around the world. You do not have the permission to view this presentation. In order to view it, please contact the author of the presentation.
Production Science Data Groups aSGuest904 Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINT lite Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 22 Category: Science & Tech.. License: All Rights Reserved Like it (0) Dislike it (0) Added: October 14, 2008 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript PIPE Dreams : PIPE Dreams Trouble Shooting Network Performance for Production Science Data Grids Presented by Warren Matthews at CHEP’03, San Diego March 24-28, 2003 Abstract : Abstract The vision of science grids allocating resources to analyze huge quantities of HENP data clearly depends on reliable network performance. Tools developed at SLAC in conjunction with the Internet2 PIPES project will help to ensure this. In this talk, these tools will be discussed and the procedure for publishing performance data, in particular using the Globus toolkit's MDS and web services will be reviewed. The subsequent analysis and trouble-shooting methodology will be discussed with real world examples from the particle physics data grid (PPDG) and the European data grid (EDG). Overview : Overview What is the problem ? What is PIPES ? Network performance monitoring Problem identification Network Monitoring for the Grid : Resource Broker Farm Farm Farm Data Data Data requestor The Network Network Monitoring for the Grid The Data Grid consists of many components that must interoperate requestor Allocate Resources : Resource Broker Farm Farm Farm Data Data Data requestor The Network Allocate Resources The resource broker must be fully informed Measurement is required ! requestor 12% pkt loss OC48 80% Utilization What is PIPES ? : What is PIPES ? Internet2 End-to-end performance initiative PI Performance Evaluation System (PIPES) PIPES Monitoring Platform (PMP) Overlap with goals of HENP Tremendous resources IEPM-BW : IEPM-BW Package developed at SLAC Measurement Engine Iperf, bbftp, bbcp, ping, traceroute Abwe, owamp, udpmon, gridftp Job Manager Data Storage and data server Analysis Engine Slide 8: SNV SLAC CHI ESnet NY Stanford CalREN NERSC LANL JLAB TRIUMF KEK Abilene SLAC SNV FNAL ANL NIKHEF CERN IN2P3 CERN CALTECH SDSC BNL JAnet HSTN SEA ATL CLV IPLS RAL UCL UManc DL NNW NY Rice UTDallas NCSA UMich I2 SOX UFL APAN RIKEN INFN-Roma INFN-Milan CESnet APAN Geant EDG PPDG/GriPhyN Monitoring Site ORNL Stanford UTAH DNVR ORNL NASA WASH Imperial INFN-Padua Slide 9: SLAC Manchester Bristol Dresden IN2P3 RAL Stanford Calren Abilene Renater DFN Janet NNW TVN SWERN ESnet BaBar Grid Geant 622Mbps 2.5 Gbps 1 Gbps 10 Gbps Problem Identification : Problem Identification Typical Scenario User complains file transfer is slow Net admin runs ping, traceroute, iperf test Complain to upstream provider Proactive What do we mean by throughput? How do we know there was a performance hit? Our approach is diurnal changes Alarms : Alarms Too much to keep track of Rather not wait for complaints Automated Alarms Rolling average à la RIPE-TT May not be the best approach AMP Automated Detection System Limitations : Limitations Could be over an hour before alarm is generated More frequent measurements impact the network and measurements overlap Low impact tools allow finer grained measurement Use NWS multi-variate method Use SCIDAC ABwE tool Use PingER, OWAMP Publishing : Publishing Many monitoring projects, publish data to allow them to inter-operate MDS EDG NM Schema Web Services GLUE NE Schema GGF NMWG Hierarchy Doc Tools Doc ./get_data 2003 3 18 6 1 41 1.61 1.601 1.62 0 Net Rat : Net Rat Alarm System Multiple tools Multiple measurement points Trigger further measurements Cross reference off site stats Informant database No measurement is ‘authoritative’ Cannot even believe a measurement Log : Log 03/20/2003 20:13:46 ALARM pcgiga throughput=305.224 ctresh=512.95 athresh=312.91 03/20/2003 20:13:48 TRACE no change in route detected 03/20/2003 20:16:07 CALM Throughput within acceptable limits. ALARM CANCELLED Toward a Monitoring Infrastructure : Toward a Monitoring Infrastructure MAGGIE Measurement and Analysis package built on NIMI/Akenti EDEE production-quality Data Grid for Europe More Information : More Information IEPM Home Page IEPM-BW I2 E2E and PIPES RIPE-TT AMP Automated Event Detection NWS ABWE End : End This talk made possible by the IEPM team at SLAC (Les Cottrell, Connie Logg, Jiri Navratil, Jerrod Williams, Fabrizio Coccetti), and the many developers and maintainers around the world.