TIP

Next: Speculative execution Up: Prefetching background Previous: Prefetching background

TIP

In the last section, we discussed why prefetching and caching decisions should depend on the dynamic state of the system. Patterson [Patterson94] and Cao [Cao94] have argued that this issue should be addressed by separating access understanding from resource allocation. Specifically, Patterson proposed that applications issue informing hints that disclose their future accesses as a sequence, allowing the underlying system to make optimal global decisions about what and when to prefetch, and what to eject from memory to make space for prefetched data. By issuing informing hints, applications would be both portable to other machines and sensitive to the changing conditions on any given machine.

To validate his proposal, Patterson designed and built TIP, an informed prefetching and caching manager that replaces the Unified Buffer Cache manager in the Digital UNIX 3.2 kernel. TIP attempts to improve use of the file cache and I/O resources by performing a cost-benefit analysis. Roughly speaking, TIP estimates the benefit of prefetching in response to a hint based on the accuracy of previous hints from the application and the immediacy of the hint. It balances this estimated benefit against an estimated cost of prefetching, which is composed of the estimated cost of ejecting a block from the cache and the estimated opportunity cost of using the I/O system. On a benchmark suite that included a range of applications, informed prefetching and caching reduced execution times by 12-72% when data files were striped over four disks (see Table 1), clearly demonstrating that application-level hints for future read accesses can be effectively used to guide intelligent prefetching and caching decisions that take advantage of the bandwidth provided by a parallel I/O system.

Benchmark Improvement Description

Agrep 72% text search

Gnuld 66% object code linker

XDataSlice 70% scientific visualization

Davidson 12% computational physics

Postgres, 20% 48% relational database, % tuples resulting

Postgres, 80% 69%

Sphinx 21% speech recognition

Table 1: Reductions in execution times using applications manually modified to issue hints for future accesses, as reported by Patterson [Patterson97]. These results were obtained on a 175MHz Digital 3000/600 with 128MB of memory running Digital UNIX 3.2c when the data was striped over four HP2247 disks with a 64KB striping unit.

Benchmark	Improvement	Description
Agrep	72%	text search
Gnuld	66%	object code linker
XDataSlice	70%	scientific visualization
Davidson	12%	computational physics
Postgres, 20%	48%	relational database, % tuples resulting
Postgres, 80%	69%
Sphinx	21%	speech recognition

These results are impressive, but the applications had to be manually modified to issue hints. For some of the applications, such as Gnuld and Sphinx, this involved significantly restructuring the code so that hints could be issued earlier and obtain more benefit from prefetching. The purpose of our research is to make the demonstrated benefits of prefetching readily accessible by automating the generation of informing hints.

Our design and implementation of speculative execution for automatic hint generation assumes that TIP is the underlying prefetching system (but could be retargetted to other prefetching systems). As shown in Table 2, TIP's hint interface includes calls which are almost directly analogous to the basic UNIX read calls. Our only modification of TIP was the addition of a CANCEL_ALL_HINTS call, which was accomplished with a few lines of code. The CANCEL_ALL_HINTS call will only cancel hints; once issued, prefetch requests cannot be cancelled.

Ioctl Parameters Description

TIPIO_SEG batch of (filename, offset, length) hints 1 or more segments from a named file

TIPIO_FD_SEG batch of (file descriptor, offset, length) hints 1 or more segments from an open file

TIPIO_CANCEL_ALL none cancels all outstanding hints from the issuing process

Table 2: Relevant portion of the hinting interface exported by TIP. We do not exercise the capability for batching hints as speculative execution discovers reads one at a time. Recall that the standard UNIX read call takes a file descriptor, a pointer to a buffer, and a length as its parameters.

Ioctl	Parameters	Description
TIPIO_SEG	batch of (filename, offset, length)	hints 1 or more segments from a named file
TIPIO_FD_SEG	batch of (file descriptor, offset, length)	hints 1 or more segments from an open file
TIPIO_CANCEL_ALL	none	cancels all outstanding hints from the issuing process

Next: Speculative execution Up: Prefetching background Previous: Prefetching background

Fay Chang
Tue Jan 5 18:05:04 EST 1999