A ‘conductor’ that ensures simultaneous processing tasks keep time could dramatically increase the efficiency of ‘cloud’ simulations.
In large-scale simulations that involve simultaneous computational tasks on distributed computers, the overall speed of the simulation is limited by the slowest link. By adaptively redistributing computational resources in real-time according to workload, a Singapore-based research team have shown how to overcome this ‘slowest link’ limitation.
This approach could dramatically improve the speed and efficiency of simulations conducted across many computers — also called ‘cloud’ simulations.
“The problem of workload imbalance is very common in large-scale simulations, which involve a group of parallel distributed computers or ‘components’ that need to synchronize with each other to ensure that all simulation events are executed in time stamp order,” explains research leader Zengxiang Li, from the A*STAR Institute of High Performance Computing.
Parallel computing simulations involve a large number of events that must occur in order. These events are assigned to multiple parallel computing ‘nodes’ for simultaneous computation. When an event is processed, new events may be generated and inserted into the event processing queue. It is wasteful to let expensive computational resources lie idle waiting for work, so parallel processing schemes often allow each node to process events sequentially without waiting for events from other nodes.
The problem is that if events from one node are late, the other nodes proceeding with their ‘optimistic’ execution of the next event will need to discard their extra work and rollback to where the late node left off. “The entire simulation execution is held back by the slowest components,” says Li, “while faster components risk wasting time and resources on overoptimistic execution and execution rollbacks.”
To improve the efficiency of such simulations, Li and his colleagues developed a resource-conducting scheme called Adaptive Resource Provisioning Mechanism in Virtual Execution Environments, or ArmVee. This scheme sits transparently as middleware in the simulation environment to monitor workloads and task completion speeds on each node in real-time. ArmVee then dynamically reallocates resources, such as memory and processing cycles, to speed up the slowest links.
“We use a self-adaptive auto-regressive-moving-average model — commonly used in control theory — to capture the relationship between simulation performance and resources,” says Li. “This allows ArmVee to predict the dynamically changing simulation workload and to align the execution speeds of simulation components proactively so that each advances in simulation time with comparable speed.”
Importantly, ArmVee can be used transparently in standard simulation architectures without any simulation recoding or interruption. This makes it ready for implementation in standard parallel and distributed simulations.
The A*STAR-affiliated researchers contributing to this research are from the Institute of High Performance Computing
 Li, Z., Cai, W., Turner, S. J., Li, X., Duong, T. N. B., Goh, R. S. M. Adaptive resource provisioning mechanism in VEEs for improving performance of HLA-based simulations. ACM Transactions on Modeling and Computer Simulation 26, 1 (2015).
Original article from Agency for Science, Technology and Research
A*STAR Research | Research SEA
Ultra-precise chip-scale sensor detects unprecedentedly small changes at the nanoscale
18.01.2017 | The Hebrew University of Jerusalem
Data analysis optimizes cyber-physical systems in telecommunications and building automation
18.01.2017 | Fraunhofer-Institut für Algorithmen und Wissenschaftliches Rechnen SCAI
An important step towards a completely new experimental access to quantum physics has been made at University of Konstanz. The team of scientists headed by...
Yersiniae cause severe intestinal infections. Studies using Yersinia pseudotuberculosis as a model organism aim to elucidate the infection mechanisms of these...
Researchers from the University of Hamburg in Germany, in collaboration with colleagues from the University of Aarhus in Denmark, have synthesized a new superconducting material by growing a few layers of an antiferromagnetic transition-metal chalcogenide on a bismuth-based topological insulator, both being non-superconducting materials.
While superconductivity and magnetism are generally believed to be mutually exclusive, surprisingly, in this new material, superconducting correlations...
Laser-driving of semimetals allows creating novel quasiparticle states within condensed matter systems and switching between different states on ultrafast time scales
Studying properties of fundamental particles in condensed matter systems is a promising approach to quantum field theory. Quasiparticles offer the opportunity...
Among the general public, solar thermal energy is currently associated with dark blue, rectangular collectors on building roofs. Technologies are needed for aesthetically high quality architecture which offer the architect more room for manoeuvre when it comes to low- and plus-energy buildings. With the “ArKol” project, researchers at Fraunhofer ISE together with partners are currently developing two façade collectors for solar thermal energy generation, which permit a high degree of design flexibility: a strip collector for opaque façade sections and a solar thermal blind for transparent sections. The current state of the two developments will be presented at the BAU 2017 trade fair.
As part of the “ArKol – development of architecturally highly integrated façade collectors with heat pipes” project, Fraunhofer ISE together with its partners...
19.01.2017 | Event News
10.01.2017 | Event News
09.01.2017 | Event News
19.01.2017 | Earth Sciences
19.01.2017 | Life Sciences
19.01.2017 | Physics and Astronomy