Folding@home
From Wikipedia, the free encyclopedia
Folding@Home | |
![]() |
|
The recently released PlayStation 3 Folding@home client displays a 3D model of the protein being simulated. |
|
Author: | Vijay Pande |
---|---|
Developer: | Stanford University / Pande Group |
Latest release: | 5.03 (Windows), 5.02 (Linux), 5.02 (Mac OS X), 1.0 (Playstation 3) |
Preview release: | 5.04 (Windows), 5.04 (Linux), 5.91beta4 (GPU), 5.91 (Windows-SMP), 5.91 (Mac OS X-SMP), 5.91 (Linux-SMP) / 2006-12-01 |
Platform: | Cross-platform |
Use: | Distributed computing |
License: | Proprietary [1] |
Website: | folding.stanford.edu |
Folding@home (also known as FAH or F@H) is a distributed computing project designed to perform computationally intensive simulations of protein folding and other molecular dynamics simulations. It was launched on October 1, 2000, and is currently managed by the Pande Group, within Stanford University's Chemistry department, under the supervision of Professor Vijay S. Pande. F@H is one of the largest distributed computing projects.[1] The goal of the project is "to understand protein folding, misfolding, and related diseases."[2]
Accurate simulations of protein folding and misfolding enable the scientific community to better understand the development of many diseases, including Alzheimer's disease, BSE (mad cow disease), Cancer, Huntington's Disease, Cystic Fibrosis and other aggregation related diseases. [2] More fundamentally, understanding the process of protein folding — how biological molecules assemble themselves into a functional state — is one of the outstanding problems of molecular biology. So far, the F@H project has successfully simulated folding in the 5-10 microsecond range — a time scale thousands of times longer than was previously thought possible.[3]
As of March 30, 2007, forty-nine scientific research papers have been published using the project's work.[4] A University of Illinois at Urbana-Champaign report dated October 22, 2002 states that F@H distributed simulations of protein folding are demonstrably accurate.[5]
Contents |
[edit] How it works
Folding@home does not rely on powerful supercomputers for its data processing; instead, the primary contributors to the F@H project are many hundreds of thousands of personal computer users who have installed a small client program. The client will, at the user's choice, run in the background utilizing otherwise unused CPU power, or run as a screensaver only while the user is away. In most modern personal computers, the CPU is rarely used to its full capacity at all times; the F@H client takes advantage of this unused processing power.
The F@H client periodically connects to a server to retrieve "work units," which are packets of data upon which to perform calculations. Each completed work unit is then sent back to the server. As data integrity is a major concern for all distributed computing projects, all work units are validated through the use of a 2048 bit digital signature.
The F@H client utilizes modified versions of four molecular simulation programs for calculation: TINKER, GROMACS, AMBER, and CPMD.[6]
Contributors to F@H may have user names used to keep track of their contributions. Each user may be running the client on one or more CPUs; for example, a user with two computers could run the client on both of them. Users may also contribute under one or more team names; many different users may join together to form a team. Contributors are assigned a score indicating the number and difficulty of completed work units. Rankings and other statistics are posted to the F@H website.
[edit] Participation
Shortly after breaking the 200,000 active CPU count on September 20, 2005, the F@H project celebrated its fifth anniversary on October 1, 2005.
As of March 25, 2007 the peak speed of the project overall has reached over 990 TFLOPS. [7]
[edit] Google & Folding@home
There used to be cooperation between Folding@home and Google Labs. This came in the form of Google Compute. Google Compute supported F@H during its early stage — when F@H had ~10,000 active CPUs. At that time, a boost of 20,000 machines was very significant. Now, the F@H client is considerably more mature than it was 5 years ago, and the project has a large number of active CPUs. The number of new clients joining Google Compute was very low (most people opted for the F@H client instead) and so it didn't make sense to continue it. Also, the Google Compute clients had certain limits: they could only run the TINKER core, limited naming, and team options. F@H is no longer supported on Google Toolbar, and even the old Google Toolbar client will not work.[8]
[edit] High performance platforms
[edit] Graphical processing units
Current research is aimed at accelerating computational power by utilizing a computer's graphics processing unit (GPU) in addition to the Central processing unit (CPU). News about the progress of porting Folding@home onto GPUs can be found in the "High performance client FAQ" section of the F@H FAQ pages.[9] Recent test data indicate performance gains of up to 40x that of an Intel Pentium 4 CPU are possible. (Note: this performance varies with different GPUs). Stanford has recently cited further advances with the high performance client and released a public, beta trial at the end of September 2006. However, this trial is specific to ATI Technologies' GPUs due to the performance characteristics of the processors for this application.[10]
As of October 2, 2006, the F@H GPU client has been released into a public beta test. After 9 days of processing from the Beta client the F@H project had received 44 teraFLOPS of computational performance from just 450 X1900 GPUs, averaging at over 70x the performance of current CPU submissions.[1]
[edit] PlayStation 3
Stanford announced in August 2006 that a folding client will be available to run on the Sony PlayStation 3.[11] The intent is that gamers be able to contribute to the project by merely "contributing electricity," leaving their PlayStation 3 consoles running the client while not playing games. PS3 firmware version 1.60 (released on Thursday, March 22) allows for F@H software, a 51MB download, to be used on the PS3. [12] A peak output of the project at 990 teraFLOPS was achieved on 25 March, 2007, at which time the number of FLOPS from each PS3 as reported by Stanford fell, reducing the overall speed rating of those machines by 50%. This had the effect of bumping down the overall project speed to the mid 700 range and increasing the number of active PS3's required to achieve a petaFLOP level to around 60,000. Lately, the console accounts for about 3/5's of all teraFLOPS.
[edit] Multi-core processing client
As more modern CPUs are being released the migration to multiple cores is becoming more adopted by the public, the Pande Group is finally adding the symmetric multiprocessing (SMP) support to the Folding@home client as well in hopes to capture the additional processing power. On November 13, 2006, the beta SMP Folding@home clients for x86-64 Linux and x86 Mac OS X have been released. The beta win32 SMP Folding@home client is out as well, but there is no news about x86-32 Linux SMP client.[13]
[edit] Folding@home teams
A typical Folding@home user, running the client on a single PC, will likely not be ranked high on the list of contributors. However, if the user were to join a team, they would add the points they receive to a larger collective. Teams work by using the combined score of all their members. Thus, teams are ranked much higher than individual submitters. Rivalries between teams create friendly competition that benefits the folding community. Many teams publish their own stats, so members can have intra-team competitions for top spots. Teams offer no real benefits other than ones of self-gratification.[14]
[edit] The future of Folding@home
With a published history of 49 peer-reviewed papers (March 2007) from the Pandegroup at Stanford, and 30 more prior to that, it seems that the project has demonstrable scientific merit.
The science behind folding is continually evolving, and more people want to do research in this field. The PI list for the folding project changes from year to year as students complete their study and new recruits take over.
The folding project is always looking to push the boundaries of technology too, provided that resulting increase in computing power is worth the initial investment.
The existing processing cores are updated on a regular basis in order to incorporate the latest features of their respective simulation methods.
Vijay himself has already said that the research he and his group are doing is still limited by the raw CPU power available to them:
“ | We have specific plans ready to put into action once we reach certain CPU levels, up to 1M[illion] CPUs. Once we start to get close to 1M active CPUs, we will sit down and make plans for going beyond that. Our research is very computer limited. | ” |
[edit] See also
[edit] Notes and references
- ^ a b Client Statistics by OS. Folding@home distributed computing. Stanford University (2006-11-12 (updated automatically)). Retrieved on 2006-11-12.
- ^ Vijay Pande (2006). Folding@home distributed computing home page. Stanford University. Retrieved on 2006-11-12.
- ^ Validity of Folding@home (Blog). Folding@home support forum. Stanford University. Retrieved on 2006-11-12.
- ^ Vijay Pande (2007). Recent Pande Group research papers. Folding@home distributed computing. Stanford University. Retrieved on 2007-03-30.
- ^ C. Snow, H. Nguyen, V. S. Pande, and M. Gruebele. (2002). "Absolute comparison of simulated and experimental protein-folding dynamics". Nature 420 (6911): 102–106. PMID 12422224.
- ^ Vijay Pande (2005-10-16). Folding@Home with QMD core FAQ (FAQ). Stanford University. Retrieved on 2006-12-03. The site indicates that Folding@home uses a modification of CPMD allowing it to run on the supercluster environment.
- ^ http://fah-web.stanford.edu/cgi-bin/main.py?qtype=osstats
- ^ What is the state of Google Compute client? (Blog). Folding@home support forum. Stanford University. Retrieved on 2006-11-12.
- ^ Folding@home high performance client FAQ. FAQs on new hardware, Folding@home. Vijay Pande and Stanford University. Retrieved on 2006-12-01.
- ^ Vijay Pande (2006-11-06). Folding@home on ATI GPU FAQ. Stanford University. Retrieved on 2006-11-13.
- ^ Vijay Pande (2006-10-22). PS3 FAQ. Stanford University. Retrieved on 2006-11-13.
- ^ http://fah-web.stanford.edu/cgi-bin/main.py?qtype=osstats
- ^ Vijay Pande (2006-11-13). Folding@home SMP Client FAQ. Stanford University. Retrieved on 2006-11-13.
- ^ Folding-community: why have teams?
- M. R. Shirts and V. S. Pande. (2000). "Screen Savers of the World, Unite!". Science 290: 1903–1904.
- C. Snow, H. Nguyen, V. S. Pande, and M. Gruebele. (2002). "Folding of a bba protein: simulation and theory.". Nature 420: 102–106.
- C. D. Snow, E. J. Sorin, Y. M. Rhee, and V. S. Pande. (2005). "How well can simulation predict protein folding kinetics and thermodynamics?". Annual Reviews of Biophysics 34: 43–69.
- L. T. Chong, C. D. Snow, Y. M. Rhee, and V. S. Pande. (2004). "Dimerization of the p53 oligomerization domain: Identification of a folding nucleus by molecular dynamics simulations.". Journal of Molecular Biology 345: 869–78.
- I. Suydam, C. D. Snow, V. S. Pande and S. G. Boxer. (2006). "Electric Fields at the Active Site of an Enzyme: Direct Comparison of Experiment with Theory.". Science in press.
- Folding-community: How can you tell the true nature of a Work Unit
- Folding-community: Vijay - No need to report EUEs