logging in or signing up BaroneComm1 Goldie Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINTLite Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 19 Category: Entertainment License: All Rights Reserved Like it (0) Dislike it (0) Added: October 29, 2007 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript WP2 - Data Management: WP2 - Data Management L.M.Barone Università di Roma & INFNWP Goals: WP Goals “...to permit the secure access of massive amounts of data...to move and replicate data at high speed from one site to another and to manage the synchronisation of remote data copies” (dal Technical Annex di DataGrid)Keywords: Keywords Automation Caching Generic Interface MetaData Data Mover Replica Manager Security People: People SEDE NOME FTE Bari: L.Silvestris 0.3 G.Zito 0.5 (0.3) Pisa: S.Arezzini 0.3 (0.3) A.Controzzi 0.5 F.Donno 0.2 (0.2) F.Schifano 0.2 Roma1: L.M.Barone 0.3 (0.3) A.Lonardo 0.3 A.Michelotti 0.3 G.Organtini 0.2 D.Rossetti 0.2 (0.2)Deliverables: Deliverables Requirements for Data Location Broker 5/2001 Definition of a metadata syntax 7/2001 Replica Management at file level 12/2001An Example: An Example Ideas for a Replica Manager: Management of production in a distributed environment: Data produced in many sites Data collected in a single reference site Data analyzed in many sites Data sometimes are moved, sometimes may be accessed via network A case study with Objectivity/DB can be extended to any kind of fileCloning federations: Cloning federations Clone FDProductions: Productions GDMP GDMP GDMP GDMPAnalysis: Analysis CERN FD DB1 DB2 DB3 DBn CERN Boot RC1 FD RC1 Boot DBn+1 DBn+m DBn+m+k DBn+m+1 DBn+m DBn+1Logical vs Physical Datasets: Logical vs Physical Datasets Dataset: H 2 Dataset: H 2e Hmm.1.hits.DB Hmm.2.hits.DB Hmm.3.hits.DB Hee.1.hits.DB id=12345 id=12346 id=12347 id=5678 Hee.2.hits.DB id=5679 Hee.3.hits.DB id=5680 pccms1.bo.infn.it::/data1/Hmm1.hits.DB shift23.cern.ch::/db45/Hmm1.hits.DB pccms1.bo.infn.it::/data1/Hmm2.hits.DB shift23.cern.ch::/db45/Hmm2.hits.DB shift23.cern.ch::/db45/Hmm3.hits.DB pccms5.roma1.infn.it::/data/Hee1.hits.DB shift49.cern.ch::/db123/Hee1.hits.DB pccms5.roma1.infn.it::/data/Hee2.hits.DB shift49.cern.ch::/db123/Hee2.hits.DB shift49.cern.ch::/db123/Hee3.hits.DB pccms5.roma1.infn.it::/data/Hee3.hits.DB pccms3.pd.infn.it::/data3/Hmm2.hits.DBLogical vs Physical Datasets: Logical vs Physical Datasets Each dataset is composed by one or more databases datasets are managed by application-sw Each DB is univocally identified by a DBid DBid assignment is a logical-db creation The physical-db is the file zero, one or more instancies The GIS manages the link between a dataset, its logical-dbs and its physical-dbsDatabase creation: CERN FD DB1 DB2 DB3 DB4 0001 DB1.DB shift.cern.ch::/shift/data 0002 DB2.DB shift.cern.ch::/shift/data 0003 DB3.DB shift.cern.ch::/shift/data 0004 DB4.DB pc.rc1.net::/pc/data shift.cern.ch::/shift/data 0005 Database creation 0001 DB1.DB shift.cern.ch::/shift/data 0002 DB2.DB shift.cern.ch::/shift/data 0003 DB3.DB shift.cern.ch::/shift/data 0004 DB4.DB pc.rc1.net::/pc/data shift.cern.ch::/shift/data 0005 DB5.DB pc.rc1.net::/pc.data 0001 DB1.DB shift.cern.ch::/shift/data 0002 DB2.DB shift.cern.ch::/shift/data 0003 DB3.DB shift.cern.ch::/shift/data 0004 DB4.DB pc.rc1.net::/pc/data shift.cern.ch::/shift/data 0005 DB5.db pc.rc1.net::/ps.data shift.cern.ch::/shift/data shift.cern.chReplica Management: 0001 DB1.DB shift.cern.ch::/shift/data pc1.bo.infn.it::/data 0002 DB2.DB shift.cern.ch::/shift/data 0003 DB3.DB shift.cern.ch::/shift/data 0001 DB1.DB shift.cern.ch::/shift/data pc1.bo.infn.it::/data 0002 DB2.DB shift.cern.ch::/shift/data pc1.bo.infn.it::/data 0003 DB3.DB shift.cern.ch::/shift/data Replica Management CERN FD DB1 DB2 DB3 BO Ref shift.cern.ch pc1.bo.infn.it PD Ref pc1.pd.infn.it DB1Example Summary: Example Summary Basic functionalities of a Replica Manager for production will be tested by end of 2000 on CMS production (GDMP) Next comes an Information Server to allow easy synchronization of federations and optimized data access during analysis The same functionalities shown for Objectivity/DB may/should be implemented for other kind of filesConclusions: Conclusions Data Management Tools are needed to face the complexity of new generation experiments (not only LHC) The GRID projects (INFN and EU) are already providing solutions to real life problems Milestones and objectives are well defined (to meet them will not be trivial) You do not have the permission to view this presentation. In order to view it, please contact the author of the presentation.
BaroneComm1 Goldie Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINTLite Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 19 Category: Entertainment License: All Rights Reserved Like it (0) Dislike it (0) Added: October 29, 2007 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript WP2 - Data Management: WP2 - Data Management L.M.Barone Università di Roma & INFNWP Goals: WP Goals “...to permit the secure access of massive amounts of data...to move and replicate data at high speed from one site to another and to manage the synchronisation of remote data copies” (dal Technical Annex di DataGrid)Keywords: Keywords Automation Caching Generic Interface MetaData Data Mover Replica Manager Security People: People SEDE NOME FTE Bari: L.Silvestris 0.3 G.Zito 0.5 (0.3) Pisa: S.Arezzini 0.3 (0.3) A.Controzzi 0.5 F.Donno 0.2 (0.2) F.Schifano 0.2 Roma1: L.M.Barone 0.3 (0.3) A.Lonardo 0.3 A.Michelotti 0.3 G.Organtini 0.2 D.Rossetti 0.2 (0.2)Deliverables: Deliverables Requirements for Data Location Broker 5/2001 Definition of a metadata syntax 7/2001 Replica Management at file level 12/2001An Example: An Example Ideas for a Replica Manager: Management of production in a distributed environment: Data produced in many sites Data collected in a single reference site Data analyzed in many sites Data sometimes are moved, sometimes may be accessed via network A case study with Objectivity/DB can be extended to any kind of fileCloning federations: Cloning federations Clone FDProductions: Productions GDMP GDMP GDMP GDMPAnalysis: Analysis CERN FD DB1 DB2 DB3 DBn CERN Boot RC1 FD RC1 Boot DBn+1 DBn+m DBn+m+k DBn+m+1 DBn+m DBn+1Logical vs Physical Datasets: Logical vs Physical Datasets Dataset: H 2 Dataset: H 2e Hmm.1.hits.DB Hmm.2.hits.DB Hmm.3.hits.DB Hee.1.hits.DB id=12345 id=12346 id=12347 id=5678 Hee.2.hits.DB id=5679 Hee.3.hits.DB id=5680 pccms1.bo.infn.it::/data1/Hmm1.hits.DB shift23.cern.ch::/db45/Hmm1.hits.DB pccms1.bo.infn.it::/data1/Hmm2.hits.DB shift23.cern.ch::/db45/Hmm2.hits.DB shift23.cern.ch::/db45/Hmm3.hits.DB pccms5.roma1.infn.it::/data/Hee1.hits.DB shift49.cern.ch::/db123/Hee1.hits.DB pccms5.roma1.infn.it::/data/Hee2.hits.DB shift49.cern.ch::/db123/Hee2.hits.DB shift49.cern.ch::/db123/Hee3.hits.DB pccms5.roma1.infn.it::/data/Hee3.hits.DB pccms3.pd.infn.it::/data3/Hmm2.hits.DBLogical vs Physical Datasets: Logical vs Physical Datasets Each dataset is composed by one or more databases datasets are managed by application-sw Each DB is univocally identified by a DBid DBid assignment is a logical-db creation The physical-db is the file zero, one or more instancies The GIS manages the link between a dataset, its logical-dbs and its physical-dbsDatabase creation: CERN FD DB1 DB2 DB3 DB4 0001 DB1.DB shift.cern.ch::/shift/data 0002 DB2.DB shift.cern.ch::/shift/data 0003 DB3.DB shift.cern.ch::/shift/data 0004 DB4.DB pc.rc1.net::/pc/data shift.cern.ch::/shift/data 0005 Database creation 0001 DB1.DB shift.cern.ch::/shift/data 0002 DB2.DB shift.cern.ch::/shift/data 0003 DB3.DB shift.cern.ch::/shift/data 0004 DB4.DB pc.rc1.net::/pc/data shift.cern.ch::/shift/data 0005 DB5.DB pc.rc1.net::/pc.data 0001 DB1.DB shift.cern.ch::/shift/data 0002 DB2.DB shift.cern.ch::/shift/data 0003 DB3.DB shift.cern.ch::/shift/data 0004 DB4.DB pc.rc1.net::/pc/data shift.cern.ch::/shift/data 0005 DB5.db pc.rc1.net::/ps.data shift.cern.ch::/shift/data shift.cern.chReplica Management: 0001 DB1.DB shift.cern.ch::/shift/data pc1.bo.infn.it::/data 0002 DB2.DB shift.cern.ch::/shift/data 0003 DB3.DB shift.cern.ch::/shift/data 0001 DB1.DB shift.cern.ch::/shift/data pc1.bo.infn.it::/data 0002 DB2.DB shift.cern.ch::/shift/data pc1.bo.infn.it::/data 0003 DB3.DB shift.cern.ch::/shift/data Replica Management CERN FD DB1 DB2 DB3 BO Ref shift.cern.ch pc1.bo.infn.it PD Ref pc1.pd.infn.it DB1Example Summary: Example Summary Basic functionalities of a Replica Manager for production will be tested by end of 2000 on CMS production (GDMP) Next comes an Information Server to allow easy synchronization of federations and optimized data access during analysis The same functionalities shown for Objectivity/DB may/should be implemented for other kind of filesConclusions: Conclusions Data Management Tools are needed to face the complexity of new generation experiments (not only LHC) The GRID projects (INFN and EU) are already providing solutions to real life problems Milestones and objectives are well defined (to meet them will not be trivial)