【摘 要】
:
Data access delay is a major bottleneck in utilizing current high-end computing (HEC) machines. Prefetching, where data is fetched before CPU demands for it, ha
【机 构】
:
Department of Computer Science,Computing Division
论文部分内容阅读
Data access delay is a major bottleneck in utilizing current high-end computing (HEC) machines. Prefetching, where data is fetched before CPU demands for it, has been considered as an effective solution to masking data access delay. However, current client-initiated prefetching strategies, where a computing processor initiates prefetching instructions, have many limitations. They do not work well for applications with complex, non-contiguous data access patterns. While technology advances continue to increase the gap between computing and data access performance,trading computing power for reducing data access delay has become a natural choice. In this paper, we present a serverbased data-push approach and discuss its associated implementation mechanisms. In the server-push architecture, a dedicated server called Data Push Server (DPS) initiates and proactively pushes data closer to the client in time. Issues,such as what data to fetch, when to fetch, and how to push are studied. The SimpleScalar simulator is modified with a dedicated prefetching engine that pushes data for another processor to test DPS based prefetching. Simulation results show that L1 Cache miss rate can be reduced by up to 97% (71% on average) over a superscalar processor for SPEC CPU2000 benchmarks that have high cache miss rates.
其他文献
We study the oscillations in the spontaneous emission rate of an atom near a dielectric slab. The emission rate is calculated as a function of system size using
Aerator is an important device for release works of hydraulic structures with high-speed flow in order to protect them from cavitation damage. This kind of prot
The isothermal and cyclic oxidizing kinetics of Co-40Cr alloy and its lanthanum ion-implanted samples were studied at 1000 ℃ in air by thermal-gravity analysis
A novel cross-linking process using two high molecular weight aromatic poly(thioether)s,which were synthesized by the reactions of 4,4-thiobisbenzenethiol with
To eliminate some disadvantages of the conventional spouted bed dryers the mechanically spouted bed (MSB) system was developed. This dryer type is convenient to
Many attempts have been made to find various relationships for different parameters and some kinds of constitutive models for studying the behavior of particula
It is a well-known fact that test power consumption may exceed that during functional operation. Leakage power dissipation caused by leakage current in Compleme
A new chromatographic method is described for the determination of specific refractive index increment(dn/dc)μ at a constant chemical potential, for polymer/mi
A novel terbium complex using 1,3,4-oxadiazole derivative as a neutral ligand was synthesized and characterized. Its thermal stability and photoluminescent prop
A metal-organic coordination polymer [Cd(bpy)(BDC)]n·nbpy (bpy = 2,2-bipyri- dine, H2BDC = terephthalic acid) has been hydrothermally synthesized and structura