logging in or signing up ipdps 07 kn funnyside Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINT Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 132 Category: Education License: All Rights Reserved Like it (0) Dislike it (0) Added: June 18, 2007 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript Volunteer Computing:Planting the FlagDavid P. AndersonSpace Sciences LabU.C. BerkeleyMarch 30, 2007: Volunteer Computing: Planting the Flag David P. Anderson Space Sciences Lab U.C. Berkeley March 30, 2007 Outline: Outline What is volunteer computing? History and current status Some research directions Non-technical problems What is Volunteer computing?: What is Volunteer computing? Projects Volunteers Helps science Involves public in science Early history: Early history GIMPS (1996) George Woltman, Scott Kurowski distributed.net (1997) Adam Beberg, Jim Lawson, ... SETI@home (1999) me, Eric Korpela Folding@Home (2000) Vijay Pande Academic projects, 1996-99: Academic projects, 1996-99 Bayanihan L. Sarmenta, MIT fault-tolerance in presence of hackers 'volunteer computing' Popcorn Noam Nisan, Hebrew Univ. Market-based approach Superweb Charlotte Jet The problem: Java Business attempts: Business attempts Entropia (1998) Popular Power United Devices (2000) Parabon Data Synapse The problem: no customers BOINC (2002): BOINC (2002) Middleware for volunteer computing Open-source (LGPL) Application-driven Goals lots of independent projects support for diverse applications client participation in multiple projects Volunteer computing != Grid computing: Volunteer computing != Grid computing Resource owners Managed systems? Clients behind firewall? anonymous, unaccountable no – need plug andamp; play software yes – pull model yes – software stack requirements OK no – push model identified, accountable ISP bill? yes no ... nor is it 'peer-to-peer computing' Some BOINC-based projects: Some BOINC-based projects Climateprediction.net Oxford; climate change study Einstein@home LIGO; gravitational wave astronomy Rosetta@home U. Washington; protein study SETI@home U.C. Berkeley; SETI LHC@home CERN; accelerator simulation Africa@home STI, U. of Geneva; malaria epidemiology IBM World Community Grid several biomedical applications ...and about 30 others Computing power: Computing power Folding@home: 650 TeraFLOPS 200 from PCs; 50 from GPUs; 400 from PS3! BOINC-based projects: A sampling of research problems: A sampling of research problems Data-intensive computing Low-latency computing Background utility compatibility Credit mechanism Efficient validation Game consoles and graphics chips Simulation Data-intensive computing – client limits: Data-intensive computing – client limits Q = network transfer per GFLOPS/hr SETI@home: Q = 0.1 MB but wider range is OK: Server-side limits: Server-side limits Server Client $ $ Using free networks: Using free networks Client $ Server $ Server $ Server $ Using more free networks: Using more free networks Client $ Server $ Server $ Server $ Client Client Low-latency computing: Low-latency computing VC usually minimizes connection frequency What if you want to do 10,000 1-minute jobs in 6 minutes of wall time? job submission time 2 min deadline 4 min Background utility compatibility: Background utility compatibility Background utilities disk defrag disk indexing virus scanning web pre-fetch disk backup Most run only when computer is idle volunteer computing ==andgt; they never run A) ignore zero-priority CPU activity B) Background manager intelligent decision about when to run various activities Credit mechanism: Credit mechanism Credit is important Currently based on FLOPS Should reflect, e.g. RAM size? Goal: maximize utility to projects Project-specific pricing schedules? Credit for other resources disk usage network transfers Efficient validation: Efficient validation Job output results claimed credit Replicated computing do job N times, require quorum of M Dealing with numerical discrepancies homogeneous replication eliminate discrepancies no replication How to validate provably and efficiently? Game consoles and graphics chips: Game consoles and graphics chips NVIDIA, ATI, Cell 10X CPU and gaining? Folding@home: ATI version Sony PS3 version BOINC and Einstein@home on PS3 How to make this available to other projects? Simulating volunteer computing: Simulating volunteer computing Ad-hoc development of scheduling policies slow, noisy jeopardizes running projects Simulation-based Randamp;D client simulator client scheduling policies Project simulator server scheduling policies Global simulator study data-intensive, low-latency, etc. The hard non-technical problems: The hard non-technical problems How to increase the number of volunteers? currently 1 in 1000 PC owners How to increase the number of projects? currently stuck at about 50 How to get volunteers to diversify? How to attract and retain volunteers?: How to attract and retain volunteers? Retention reminder emails frequent science updates Recruitment Viral 'email a friend', referral reward Organizational World Community Grid: 'partner' program Media coverage need more discoveries Bundling Active hosts: Why aren’t there more projects?: Why aren’t there more projects? Lack of PR among scientists IT antipathy Creating a BOINC project is expensive: Science App development Experiment design Paper writing Research group Software/IT Port/debug apps workflow tools server admin Communications Web site development message board admin public relations Meta-projects: Meta-projects Virtual Campus Supercomputing Center Deployment and publicity: PC labs, staff/faculty desktops students alumni public IBM World Community Grid Science App development Experiment design Paper writing Research groups Software/IT Port/debug apps workflow tools server admin Communications Web site development message board admin public relations Berkeley@home Existing UCB staff Encouraging change: Encouraging change Cross-project credit system encourage competition in total credit, not per-project Account Managers Make it easier to discover/attach/detach projects GridRepublic, BAM! Science Stock Market? encourage participation in new high-potential projects Scientific Mutual Funds? e.g. American Cancer Society BOINC 'portfolio' Conclusion: Conclusion Volunteer computing: a new paradigm distinct research problems, software requirements big accomplishments, potential time to plant the flag! Social impacts Contact me about: Using BOINC Research based on BOINC Organizational use of BOINC davea@ssl.berkeley.edu You do not have the permission to view this presentation. In order to view it, please contact the author of the presentation.
ipdps 07 kn funnyside Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINT Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 132 Category: Education License: All Rights Reserved Like it (0) Dislike it (0) Added: June 18, 2007 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript Volunteer Computing:Planting the FlagDavid P. AndersonSpace Sciences LabU.C. BerkeleyMarch 30, 2007: Volunteer Computing: Planting the Flag David P. Anderson Space Sciences Lab U.C. Berkeley March 30, 2007 Outline: Outline What is volunteer computing? History and current status Some research directions Non-technical problems What is Volunteer computing?: What is Volunteer computing? Projects Volunteers Helps science Involves public in science Early history: Early history GIMPS (1996) George Woltman, Scott Kurowski distributed.net (1997) Adam Beberg, Jim Lawson, ... SETI@home (1999) me, Eric Korpela Folding@Home (2000) Vijay Pande Academic projects, 1996-99: Academic projects, 1996-99 Bayanihan L. Sarmenta, MIT fault-tolerance in presence of hackers 'volunteer computing' Popcorn Noam Nisan, Hebrew Univ. Market-based approach Superweb Charlotte Jet The problem: Java Business attempts: Business attempts Entropia (1998) Popular Power United Devices (2000) Parabon Data Synapse The problem: no customers BOINC (2002): BOINC (2002) Middleware for volunteer computing Open-source (LGPL) Application-driven Goals lots of independent projects support for diverse applications client participation in multiple projects Volunteer computing != Grid computing: Volunteer computing != Grid computing Resource owners Managed systems? Clients behind firewall? anonymous, unaccountable no – need plug andamp; play software yes – pull model yes – software stack requirements OK no – push model identified, accountable ISP bill? yes no ... nor is it 'peer-to-peer computing' Some BOINC-based projects: Some BOINC-based projects Climateprediction.net Oxford; climate change study Einstein@home LIGO; gravitational wave astronomy Rosetta@home U. Washington; protein study SETI@home U.C. Berkeley; SETI LHC@home CERN; accelerator simulation Africa@home STI, U. of Geneva; malaria epidemiology IBM World Community Grid several biomedical applications ...and about 30 others Computing power: Computing power Folding@home: 650 TeraFLOPS 200 from PCs; 50 from GPUs; 400 from PS3! BOINC-based projects: A sampling of research problems: A sampling of research problems Data-intensive computing Low-latency computing Background utility compatibility Credit mechanism Efficient validation Game consoles and graphics chips Simulation Data-intensive computing – client limits: Data-intensive computing – client limits Q = network transfer per GFLOPS/hr SETI@home: Q = 0.1 MB but wider range is OK: Server-side limits: Server-side limits Server Client $ $ Using free networks: Using free networks Client $ Server $ Server $ Server $ Using more free networks: Using more free networks Client $ Server $ Server $ Server $ Client Client Low-latency computing: Low-latency computing VC usually minimizes connection frequency What if you want to do 10,000 1-minute jobs in 6 minutes of wall time? job submission time 2 min deadline 4 min Background utility compatibility: Background utility compatibility Background utilities disk defrag disk indexing virus scanning web pre-fetch disk backup Most run only when computer is idle volunteer computing ==andgt; they never run A) ignore zero-priority CPU activity B) Background manager intelligent decision about when to run various activities Credit mechanism: Credit mechanism Credit is important Currently based on FLOPS Should reflect, e.g. RAM size? Goal: maximize utility to projects Project-specific pricing schedules? Credit for other resources disk usage network transfers Efficient validation: Efficient validation Job output results claimed credit Replicated computing do job N times, require quorum of M Dealing with numerical discrepancies homogeneous replication eliminate discrepancies no replication How to validate provably and efficiently? Game consoles and graphics chips: Game consoles and graphics chips NVIDIA, ATI, Cell 10X CPU and gaining? Folding@home: ATI version Sony PS3 version BOINC and Einstein@home on PS3 How to make this available to other projects? Simulating volunteer computing: Simulating volunteer computing Ad-hoc development of scheduling policies slow, noisy jeopardizes running projects Simulation-based Randamp;D client simulator client scheduling policies Project simulator server scheduling policies Global simulator study data-intensive, low-latency, etc. The hard non-technical problems: The hard non-technical problems How to increase the number of volunteers? currently 1 in 1000 PC owners How to increase the number of projects? currently stuck at about 50 How to get volunteers to diversify? How to attract and retain volunteers?: How to attract and retain volunteers? Retention reminder emails frequent science updates Recruitment Viral 'email a friend', referral reward Organizational World Community Grid: 'partner' program Media coverage need more discoveries Bundling Active hosts: Why aren’t there more projects?: Why aren’t there more projects? Lack of PR among scientists IT antipathy Creating a BOINC project is expensive: Science App development Experiment design Paper writing Research group Software/IT Port/debug apps workflow tools server admin Communications Web site development message board admin public relations Meta-projects: Meta-projects Virtual Campus Supercomputing Center Deployment and publicity: PC labs, staff/faculty desktops students alumni public IBM World Community Grid Science App development Experiment design Paper writing Research groups Software/IT Port/debug apps workflow tools server admin Communications Web site development message board admin public relations Berkeley@home Existing UCB staff Encouraging change: Encouraging change Cross-project credit system encourage competition in total credit, not per-project Account Managers Make it easier to discover/attach/detach projects GridRepublic, BAM! Science Stock Market? encourage participation in new high-potential projects Scientific Mutual Funds? e.g. American Cancer Society BOINC 'portfolio' Conclusion: Conclusion Volunteer computing: a new paradigm distinct research problems, software requirements big accomplishments, potential time to plant the flag! Social impacts Contact me about: Using BOINC Research based on BOINC Organizational use of BOINC davea@ssl.berkeley.edu