[PDF][PDF] Data prefetching in multiprocessor vector cache memories

JWC Fu, JH Patel - ACM SIGARCH Computer Architecture News, 1991 - dl.acm.org
… technique we show how a non-prefetching vector cache can result in unpredictable … by
fetching a small data size for vector misses with long strides and the stride-prefetch

Vector prefetching

MK Gschwind, TJ Pietsch - ACM SIGARCH Computer Architecture News, 1995 - dl.acm.org
Prefetching Basics To measure the benefit which can be derived by using intelligent prefetching,
we decided to add a prefetching … applied to speed up list and vector processing [GP93]. …

Exploiting Vector Code Semantics for Efficient Data Cache Prefetching

F Martínez Palau, M Torrents, A Armejach… - Proceedings of the 38th …, 2024 - dl.acm.org
… to run in a vector processor regardless of the size of its vector registers. In … vector length
that cannot be changed. This paper demonstrates that VLA vector ISAs bring new prefetching

[PDF][PDF] Stride directed prefetching in scalar processors

JWC Fu, JH Patel, BL Janssens - ACM SIGMICRO Newsletter, 1992 - dl.acm.org
… Stride directed prefetching has been proposed to improve the cache performance of numerical
programs executing on a vector processor, This paper shows how this approach can be …

Data prefetch mechanisms

SP Vanderwiel, DJ Lilja - ACM Computing Surveys (CSUR), 2000 - dl.acm.org
… In the case of prefetch-on-miss, prefetching is suppressed … Although vector prefetching does
issue prefetches for large … strides benefit the most from vector prefetching, as expected. For …

When prefetching works, when it doesn't, and why

J Lee, H Kim, R Vuduc - ACM Transactions on Architecture and Code …, 2012 - dl.acm.org
prefetching schemes we consider use prefetch intrinsics on top of gcc or icc compiler-inserted
prefetching. … We define prefetch distance as the distance ahead of which a prefetch should …

A hardware prefetching mechanism for vector gather instructions

H Takayashiki, M Sato, K Komatsu… - 2019 IEEE/ACM 9th …, 2019 - ieeexplore.ieee.org
… the memory access latencies of vector gather instructions. … a single vector gather instruction
by varying the prefetching … a prefetching mechanism for the vector gather instruction. …

[BOOK][B] A primer on hardware prefetching

B Falsafi, TF Wenisch - 2022 - books.google.com
… A key innovation of the design is a compressed representation of the instruction sequence
that uses bit vectors to efficiently encode spatial locality among prefetch addresses. Follow-on …

Compiler-generated vector-based prefetching on architectures with distributed memory

MM Müller - High Performance Computing in Science and …, 2001 - Springer
… In principal, vector commands for prefetching are only useful … prefetch instructions are used
throughout the whole prefetchvector strategy that declares the usage of vector operations. …

Effects of MSHR and prefetch mechanisms on an on-chip cache of the vector architecture

A Musa, Y Sato, T Soga, R Egawa… - … on Parallel and …, 2008 - ieeexplore.ieee.org
… (MSHR) and a prefetch mechanism. This paper evaluates the performance of the vector
cache with the MSHR and the prefetch mechanism on the vector supercomputer across three …