IPR2020-01301, No. 1018 Exhibit - Ex 1018 Griffioen (P.T.A.B. Jul. 15, 2020)

Reducing File System Latency using a
`Predictive Approach (cid:3)
`
`James Grifﬁoen, Randy Appleton
`Department of Computer Science
`University of Kentucky
`Lexington, KY 40506
`
`Abstract
`
`Despite impressive advances in ﬁle system throughput
`resulting from technologies such as high-bandwidth
`networks and disk arrays, ﬁle system latency has not
`improved and in many cases has become worse. Con-
`sequently, ﬁle system I/O remains one of the major
`bottlenecks to operating system performance [10].
`This paper investigates an automated predictive
`approach towards reducing ﬁle latency. Automatic
`Prefetching uses past ﬁle accesses to predict future
`ﬁle system requests. The objective is to provide data in
`advance of the request for the data, effectively masking
`access latencies. We have designed and implement a
`system to measure the performance beneﬁts of auto-
`matic prefetching. Our current results, obtained from
`a trace-driven simulation, show that prefetching results
`in as much as a 280% improvement over LRU espe-
`cially for smaller caches. Alternatively, prefetching
`can reduce cache size by up to 50%.
`
`1 Motivation
`
`Rapid improvements in processor and memory speeds
`have created a situation in which I/O, in particular ﬁle
`system I/O, has become the major bottleneck to operat-
`ing system performance [10]. Recent advances in high
`bandwidth devices (e.g., RAID, ATM networks) have
`had a large impact on ﬁle system throughput. Unfor-
`tunately, access latency still remains a problem and is
`not likely to improve signiﬁcantly due to the physical
`limitations of storage devices and network transfer la-
`tencies. Moreover, the increasing popularity of certain
`ﬁle system designs such as RAID, CDROM, wide area
`distributed ﬁle systems, wireless networks, and mo-
`bile hosts has only exacerbated the latency problem.
`For example, distributed ﬁle systems experience net-
`work latency combined with standard disk latency. As
`
`

`2.1 Caching
`
`Caching has been used successfully in many systems
`to substantially reduce the amount of ﬁle system I/O
`[16, 6, 8, 1]. Despite the success of caching, it is pre-
`cisely the accesses that cannot be satisﬁed from the
`cache that are the current bottleneck to ﬁle system per-
`formance [10]. Unfortunately, increasing the cache
`size beyond a certain point only results in minor per-
`formance improvements. Experience shows that the
`relative beneﬁt of caching decreases as cache size (and
`thus cache cost) increases [9, 8]. There exists a thresh-
`old beyond which performance improvements are mi-
`nor and prohibitively expensive. Moreover, studies
`show that the “natural” cache size or threshold is be-
`coming a substantially larger fraction (one forth to one
`third) of the total memory, due in part to larger ﬁles
`(e.g., big applications, databases, video, audio, etc.)
`[2]. Consequently, new methods are needed to reduce
`the perceived latency of ﬁle accesses and keep cache
`sizes in check.
`Although machines with large memories are now
`available,
`low-end workstations, PCs, mobile lap-
`tops/notebooks, and now PDAs (personal data assis-
`tants) with limited memory capacities enjoy wide-
`spread use. Because of cost or space constraints these
`machines cannot support large ﬁle caches. The desire
`for smaller portable machines combined with continu-
`ally increasing ﬁles size means that large caches cannot
`be assumed to be the complete solution to the latency
`problem.
`Finally, as a result of rapid improvements in band-
`width, cache miss service times are dominated by la-
`tency. Note that:
`
`(cid:15) Most ﬁles are quite small. In fact, measurements
`of existing distributed ﬁle systems show that the
`average ﬁle is only a few kilobytes long [9, 2].
`For ﬁles of this size, transmission rate is of lit-
`tle concern when compared to the access latency
`across a WAN or from a slow device. As a result,
`access latency, not bandwidth, becomes the dom-
`inate cost for references to ﬁles not in the cache.
`
`(cid:15) In many distributed ﬁle systems, the open() and
`close() functions represent synchronization points
`for shared ﬁles. Although the ﬁle itself may reside
`in the client cache, each open() and close() call
`must be executed at the server for consistency
`reasons. The latency of these calls can be quite
`large, and tends to dominate other costs, even
`when the ﬁle is in the ﬁle cache.
`
`In short, the beneﬁts of standard caching have been
`realized. To improve ﬁle system performance further
`
`and keep ﬁle cache sizes in check, caching will need to
`be supplemented with new methods and algorithms.
`
`2.2 Prefetching
`
`The concept of prefetching has been used in a va-
`riety of environments including microprocessor de-
`signs, virtual memory paging, databases, and ﬁle read
`ahead. More recently, long term prefetching has been
`used in ﬁle systems to support disconnected operation
`[14, 15, 5]. Prefetching has also been used to improve
`parallel ﬁle access on MIMD architectures [4].
`One relatively straight forward method of prefetch-
`ing is to have each application inform the operating
`system of its future requirements. This approach has
`been proposed by Patterson et. al. [11]. Using this ap-
`proach, the application program informs the operating
`system of its future ﬁle requirements, and the operating
`system then attempts to optimize those accesses. The
`basic idea is that the application knows what ﬁles will
`be needed and when they will be needed.
`Application directed prefetching is certainly a step
`in the right direction. However, there are several draw-
`backs to this approach. Using this approach, applica-
`tions must be rewritten to inform the operating system
`of future ﬁle requirements. Moreover, the program-
`mer must learn a reasonably complex set of additional
`system directives that must be strategically deployed
`throughout the program. This implies that the appli-
`cation writer must have a thorough understanding of
`the application and its ﬁle access patterns. Ironically, a
`key goal of many recent languages, in particular object-
`oriented languages, is abstraction and encapsulation;
`hiding the implementation details from the program-
`mer. Even when the details are visible, our experience
`indicates that the enormity and complexity of many
`software systems creates a situation in which experts
`may have difﬁculty grasping the complete picture of
`ﬁle access patterns. Moreover, incorrectly placed di-
`rectives or an incomplete set of directives can actually
`degrade performance rather than improve it.
`A second problem is that the operating system needs
`a signiﬁcant lead-time to insure the ﬁle is available
`when needed. Therefore, in order to beneﬁt from
`prefetching, the application must have a signiﬁcant
`amount of computation to do between the time the ﬁle
`is predicted and the time the ﬁle is accessed. However,
`many applications do not know which ﬁles they will
`need until the actual need arises. For instance, the pre-
`processor of a compiler does not know the pattern of
`nested include ﬁles until the ﬁles are actually encoun-
`tered in the input stream, nor will an editor necessarily
`know which ﬁles a user normally edits. Our approach
`attempts to solve this problem by predicting the need
`
`Adobe - Exhibit 1018, page 2
`
`

`for a ﬁle well in advance of when the application could;
`in some cases long before the application even begins
`to execute.
`A third problem with application driven prefetching
`arises in situations where related ﬁle accesses span mul-
`tiple executables. Typically applications are written in-
`dependently and only know ﬁle access patterns within
`the application. In situations where a series of applica-
`tions execute repeatedly, like an edit/compile/run cycle,
`or certain commonly run shell scripts, no one applica-
`tion knows the cross-application ﬁle access patterns,
`and therefore cannot inform the operating system of a
`future application’s ﬁle requirements. In some cases,
`batch-type utilities, such as the Unix make facility, can
`be instrumented to understand cross-application access
`patterns. However, even in this case, a complete view
`of the real cross application pattern is often unknown to
`the user or requires extreme expertise to determine the
`pattern. Our approach uses long term history informa-
`tion to support prefetching across application bound-
`aries.
`
`3 Automatic Prefetching
`
`We are investigating an approach we call automatic
`prefetching,
`in which the operating system rather
`than the application predicts future ﬁle requirements.
`The basic idea and hypothesis underlying automatic
`prefetching is that future ﬁle activity can be success-
`fully predicted from past ﬁle activity. This knowledge
`can then be used to improve overall ﬁle system perfor-
`mance.
`Automatic prefetching has several advantages over
`existing approaches. First, existing applications do not
`need to be rewritten or modiﬁed, nor do new appli-
`cations need to incorporate non-portable prefetching
`operations. As a result, all applications receive the
`beneﬁts of automatic prefetching, including existing
`software. Second, because the operating system au-
`tomatically performs prefetching on the application’s
`behalf, application writers can concentrate on solving
`the problem at hand rather than worrying about opti-
`mizing ﬁle system performance. Third, the operating
`system monitors ﬁle access across application bound-
`aries and can thus detect access patterns that span mul-
`tiple applications executed repeatedly. Consequently,
`the operating system can prefetch ﬁles substantially
`earlier than the ﬁle is actually needed, often before the
`application even begins to execute.
`Automatic prefetching allows the operating system
`effectively to overlap processing with ﬁle transfers.
`The operating system can also use past access infor-
`mation to batch together multiple ﬁle requests and thus
`make better use of available bandwidth. Past access in-
`
`formation can also be used to improve the cache man-
`agement algorithm, effectively reducing cache misses
`even if no prefetching occurs.
`The ﬁrst goal of our research was to determine
`whether such an approach is viable. Our second goal
`was to develop effective prefetch policies and quantify
`the beneﬁts of automatic prefetching. The following
`sections consider each of these objectives and describe
`our results.
`
`4 Analysis of Existing Systems
`
`To determine the viability of automatic prefetching, we
`analyzed current ﬁle system usage patterns. Although
`other researchers have gathered ﬁle system traces [9, 2],
`we decided to modify the SunOS kernel in order to
`gather our own traces that extract speciﬁc information
`important to our research. In addition to recording all
`ﬁle system calls made by the system, the kernel gathers
`precise information regarding the issuing process and
`the timing for every operation. The timing information
`not only serves as an indicator of the system’s perfor-
`mance, but it also provides information as to whether
`prefetching can have any substantial effects on perfor-
`mance.
`We gathered a variety of traces, including the normal
`daily usage of several researchers, and also various
`synthetic workloads. Traces were collected on a single
`Sun Sparcstation supporting several users executing a
`variety of tasks. Traces were collected for varying time
`periods with the longest traces spanning more than 10
`days and containing over 500,000 operations. Users
`were not restricted in any way. Typical daily usage
`included users processing email, editing, compiling,
`preparing documents and executing other task typical
`of an academic environment. This particular set of
`traces contains almost no database activity. The data
`we collected appears to be in line with that of other
`studies [9, 2] given similar workloads.
`Our initial analysis of the trace data indicates that
`typical ﬁle system usage can realize substantial per-
`formance improvements from the use of prefetching,
`and also provides several guidelines for a successful
`prefetching policy.
`First, the data shows that there is relatively little time
`between the moment when a ﬁle is opened and the
`moment when the ﬁrst read occurs (see ﬁgure 1). In
`fact, the median time for our traces was less than three
`milliseconds. Consequently, prefetching must occur
`signiﬁcantly earlier than the open operation to achieve
`any signiﬁcant performance improvement. Prefetching
`at open time will only provide minor improvements.
`Second, the data shows that the average amount
`of time between successive opens is substantial (200
`
`Adobe - Exhibit 1018, page 3
`
`

`3
`
`6
`
`9
`
`12
`
`15
`
`18
`
`21
`
`27
`24
`Time in ms
`
`30
`
`33
`
`36
`
`39
`
`42
`
`45
`
`48
`
`Figure 1: Histogram of times between open and ﬁrst read of a ﬁle.
`
`60
`
`50
`
`40
`
`30
`
`20
`
`10
`
`0
`
`0
`
`Percent of all Opens
`
`ms). If the operating system can accurately predict the
`next ﬁle that will be accessed, there exists a sufﬁcient
`amount of time to prefetch the ﬁle.
`In a multi-user, multiprogramming environment,
`concurrently executing tasks may generate an inter-
`leaved stream of ﬁle requests. In such an environment,
`reliable access patterns may be difﬁcult to obtain. Even
`when patterns are discernable, the randomness of the
`concurrency may render the prefetching effort inef-
`fective. However, analysis of trace data consisting of
`multiple users (and various daemons) shows that even
`in a multiprogramming environment accesses tend to
`be “sequential” where we deﬁne sequential as a sen-
`sible/predictable uninterrupted progression of ﬁle ac-
`cesses associated with a task. In fact, measurements
`show that over 94% of the accesses follow logically
`from the previous access. Thus multiprogramming
`seems to have little effect on the ability to predict the
`next ﬁle referenced.
`
`5 The Probability Graph
`
`We have designed and implemented a simple analyzer
`that attempts to predict future accesses based on past
`access patterns. Driven by trace data, the analyzer
`dynamically creates a logical graph called a Probability
`Graph. Each node in the graph represents a ﬁle in the
`ﬁle system.
`Before describing the probability graph, we must de-
`
`ﬁne the lookahead period used to construct the graph.
`The lookahead period deﬁnes what it means for one ﬁle
`to be opened “soon” after another ﬁle. The analyzer
`deﬁnes the lookahead period to be a ﬁxed number of
`ﬁle open operations that occur after the current open.
`If a ﬁle is opened during this period, the open is consid-
`ered to have occurred “soon” after the current open. A
`physical time measure rather than a virtual time mea-
`sure could be used, but the above measure is easily
`obtained and can be argued to be a better deﬁnition
`of “soon” given the unknown execution times and ﬁle
`access patterns of applications. Our results show that
`this measure works well in practice.
`We say two ﬁles are related if the ﬁles are opened
`within a lookahead period of one another. For example,
`if the lookahead period is one, then the next ﬁle opened
`is the only ﬁle considered to be related to the current
`ﬁle. If the lookahead period is ﬁve, then any ﬁle opened
`within ﬁve ﬁles of the current ﬁle is considered to be
`related to the current ﬁle.
`The analyzer allocates a node in the probability
`graph for each ﬁle of interest in the ﬁle system. Unix
`exec system calls are treated like opens and thus are
`included in the probability graph. One graph, derived
`from the trace described in section 7, generated ap-
`proximately 6,500 nodes accessed over an eight day
`period. Each node consumes less than one hundred
`bytes, and can be efﬁciently stored on disk in the inode
`of each associated ﬁle, with active portions cached for
`
`Adobe - Exhibit 1018, page 4
`
`

`better performance. Our current graph storage scheme
`has not been optimized and thus is rather wasteful. We
`have recently begun investigating methods that will
`substantially reduce the graph size via graph pruning,
`aging, and/or compression.
`Arcs in the probability graph represent related ac-
`cesses.
`If the open for one ﬁle follows within the
`lookahead period of the open for a second ﬁle, a di-
`rected arc is drawn from the ﬁrst to the second. Larger
`lookaheads produce more arcs. The analyzer weighs
`each arc by the number of times that the second ﬁle is
`accessed after the ﬁrst ﬁle. Thus, the graph represents
`an ordered list of ﬁles demanded from the ﬁle system,
`and each arc represents the probability of a particular
`ﬁle being opened soon after another ﬁle.
`Figure 2 illustrates the structure of an example prob-
`ability graph. The probability graph provides the in-
`
`66
`
`40
`
`65
`
`config
`
`alloca.h
`
`97
`
`4
`
`30
`
`171
`
`131
`
`tm.h
`
`40
`
`Figure 2: Three nodes of an example probabilitygraph.
`
`formation necessary to make intelligent prefetch de-
`cisions. We deﬁne the chance of a prediction being
`correct as the probability of a ﬁle (say ﬁle B) being
`opened given the fact that another ﬁle (ﬁle A) has been
`opened. The chance of ﬁle B following ﬁle A can be
`obtained from the probability graph as the ratio of the
`number of arcs from ﬁle A to ﬁle B divided by the total
`number of arcs leaving ﬁle A. We say a prediction is
`reasonable if the estimated chance of the prediction is
`above a tunable parameter minimum chance. We say
`a prediction is correct if the ﬁle predicted is actually
`opened within the lookahead period.
`Establishing a minimum chance requirement is cru-
`cial to avoid wasting system resources. In the absence
`of a minimum requirement, the analyzer would produce
`several predictions for each ﬁle open, consuming net-
`work and cache resources with each prediction, many
`of which would be incorrect.
`
`To measure the success of the analyzer we deﬁne an
`accuracy value. The accuracy of a set of predictions is
`the number of correct predictions divided by the total
`number of predictions made. The accuracy will almost
`always be at least as large as the minimum chance, and
`in practice is substantially higher.
`The number of predictions made per open call varies
`with the required accuracy of the predictions. Re-
`quiring very accurate predictions (predictions that are
`almost never wrong) means that only a limited number
`of predictions can be made. For one set of trace data,
`using a relatively low minimum chance value (65%) the
`predictor averaged 0.45 ﬁles predicted per open. For
`higher minimum chance values (95%) the predictor av-
`eraged only 0.1 ﬁles predicted per open. Even when
`using a relatively low minimum chance (e.g., 65%), the
`predictor was able to make a prediction about 40% of
`the time and was correct on approximately 80% of the
`predictions made.
`Figure 3 shows the distribution of estimated chance
`values with a lookahead of one. The distributionshows
`that a large number of predictions have an estimated
`chance of 100%. Setting the minimum chance less
`than 50% places the system in danger of prefetching
`many unlikely ﬁles. By setting the minimum chance at
`50%, very few ﬁles that should have been prefetched
`will be missed. Moreover, the distribution shows how
`a low minimum chance can still result in a high average
`accuracy.
`
`6 A Simulation System
`
`To evaluate the performance of systems based on au-
`tomatic prefetching, we implemented a simulator that
`models a ﬁle system.
`In order to simulate a variety
`of ﬁle system architectures having a variety of perfor-
`mance characteristics, the simulator is highly parame-
`terized and can be adjusted to model several ﬁle system
`designs. This ﬂexibility allows us to measure and com-
`pare the performance of various cache management
`policies and mechanisms under a wide variety of ﬁle
`system conditions. The simulator consists of four basic
`components: a driver, cache manager, disk subsystem,
`and predictor.
`The driver reads a timestamped ﬁle system trace and
`translates each ﬁle access into a ﬁle system request for
`the simulator to process. Because the driver generates
`ﬁle requests directly from the trace data, the workload
`is exactly like that of typical (concurrent) user-level
`applications. However, the driver must modify the
`set of requests in a few special cases. Because the
`simulator is only interested in ﬁle system I/O activity,
`the driver removes accesses made to ﬁles representing
`devices such as terminals or /dev/null. References to
`
`Adobe - Exhibit 1018, page 5
`
`

`20
`
`60
`40
`Estimated Chance in Percent
`
`80
`
`100
`
`30
`
`25
`
`20
`
`15
`
`10
`
`5
`
`0
`
`0
`
`Percent of All Arcs
`
`Figure 3: Histogram of estimated chances given a lookahead of one.
`
`certain standard shared libraries such as the C library
`are also eliminated. Accesses (e.g., mmap() calls) to
`these libraries rarely require any ﬁle system activity,
`since they are typically already present in the virtual
`memory cache.
`The cache manager manages a simulated ﬁle cache
`and services as many requests as possible from the
`cache without invoking the disk subsystem. We have
`implemented two cache managers. The ﬁrst is a stan-
`dard LRU cache manager, where disk pages are re-
`placed in the order of least recent use. The second
`cache manager is the prefetch cache manager. The
`prefetch cache manager operates much like the LRU
`manager, updating timestamps on each access and re-
`placing the least recently used page. However, the
`prefetch manager also updates timestamps based on
`knowledge of expected accesses from the predictor,
`thus rescuing some-soon-to-be-accessed pages from
`replacement. We have found that prefetch cache man-
`agement can improve performance even if no prefetch-
`ing occurs (i.e., no pages are actually brought in ahead
`of time). When run in prefetch mode, the simulator
`shows that anywhere between 5% and 30% of the per-
`formance improvement comes from pages that were
`rescued rather than actually being prefetched.
`The task of the disk subsystem is to simulate a ﬁle
`storage device. The current disk subsystem has been
`conﬁgured to emulate local disks. Local disk have rel-
`atively low latency when compared to our other target
`
`ﬁle systems (e.g., wide area distributed ﬁle systems,
`CDROMs, RAIDs, or wireless networks). Conse-
`quently, we expect that the performance improvements
`realized with a local disk model will only be ampliﬁed
`in our other target environments. In the following tests,
`we assumed a disk model with a ﬁrst access latency of
`15 ms and a transfer rate of 2 MB/sec after factoring in
`typical ﬁle system overhead.
`Finally, the simulator contains a predictor. The
`predictor observes open requests that arrive from the
`driver, and records the data in the probability graph
`described earlier. The predictor builds the probability
`graph dynamically just as it would be done in a real
`system. The longer the simulator executes, the wiser it
`becomes. On each access the simulator gains a clearer
`understanding of the true access patterns.
`During each open, the probability graph is examined
`for prefetch opportunities. If an opportunity is discov-
`ered, then a read request is sent to the cache manager. If
`the cache contains the appropriate data, then the data’s
`access time is set to the current time. This ensures
`that the data will be present for the anticipated need,
`and possibly rescues the data from an impending ﬂush
`from the cache. If the prefetch request cannot be satis-
`ﬁed from the cache, then it is prefetched from the disk
`subject to the characteristics of the disk subsystem.
`Notice that the current disk subsystem does no re-
`ordering of requests. In particular, it does not preempt
`or defer prefetch requests to satisfy subsequent appli-
`
`Adobe - Exhibit 1018, page 6
`
`

`200
`
`400
`
`Time in ms
`
`600
`
`800
`
`1000
`
`12
`
`10
`
`8
`
`6
`
`4
`
`2
`
`0
`
`0
`
`Percent of all Prefetches
`
`Figure 4: Histogram of times between prefetch and ﬁrst read access.
`
`cation requests. Reordering and prioritizing requests
`represents an area of further potential performance im-
`provements.
`We are currently in the process of implementing the
`automatic prefetching system inside a Unix kernel run-
`ning NFS to measure performance on an actual system.
`
`7 Experimental Results
`
`We performed several tests to measure the performance
`improvements achieved by automatic prefetching. For
`the particular set of tests described below, a trace taken
`over an eight day period containing the unrestricted
`activity of multiple users was used. To determine the
`performance beneﬁts of prefetching, we ran several
`simulations varying the cache size, lookahead value,
`and minimum chance and also measured the LRU per-
`formance in each case for comparison purposes.
`Recall from section 4, that the time between the open
`of a ﬁle and the ﬁrst read is too small for prefetching to
`be effective. Figure 4 shows that the simulator is able
`to predict and begin prefetching ﬁles sufﬁciently far in
`advance of the ﬁrst read to the ﬁle. Our measurements
`indicate that 94% of the ﬁles that were predicted and
`then subsequently access were prefetched more than
`20 ms before the actual need, resulting in cache hits at
`the time of the ﬁrst read.
`
`7.1 Prefetch Parameters Effect on Perfor-
`mance
`
`Two parameters that signiﬁcantly affect the predictions
`made by the predictor are the lookahead and minimum
`chance values.
`Recall that the lookahead represents how close two
`ﬁle opens need be for the ﬁles to be considered related.
`Setting this value very large increases the number of
`ﬁles that are considered related to each other, and there-
`fore each ﬁle open may potentially cause several other
`ﬁles to be prefetched.
`Large lookaheads increase the number of ﬁles
`prefetched since more predictions are made in response
`to each open request. Moreover, large lookaheads re-
`sult in ﬁles being prefetched substantially earlier, be-
`cause predictions can be made much further in ad-
`vance. As a result, large lookaheads are inappropriate
`for smaller cache sizes, but often perform very well
`with larger caches1. In the case of small caches, large
`lookaheads tend to prefetch ﬁles too far in advance of
`the need. As a result, data necessary to the current com-
`putation may be forced out of the cache and replaced
`
`

`Miss Rate in Percent
`
`56
`
`54
`
`52
`
`50
`
`48
`
`46
`44
`
`0.9
`
`MinChance
`
`0.7
`
`5
`
`10
`
`Lookahead
`
`0.5
`
`15
`
`Figure 5: Cache misses as function of lookahead and MinChance for a 400K cache. Performance varies by as much
`as 13% (between 43% and 56%) depending on the lookahead and minchance settings.
`
`Miss Rate in Percent
`
`11
`
`10
`
`0.9
`
`MinChance
`
`0.7
`
`0.5
`
`5
`
`10
`
`15
`
`Lookahead
`
`Figure 6: Cache misses as function of lookahead and MinChance for a 4M cache. Performance varies by as much as
`2% (between 9% and 11%) depending on the lookahead and minchance settings.
`
`Adobe - Exhibit 1018, page 8
`
`

`Prefetch
`Lru
`
`60
`
`50
`
`40
`
`30
`
`20
`
`10
`
`Miss Rate in Percent
`
`0
`
`500
`
`1000
`
`1500
`
`2500
`2000
`Cache size in KB
`
`3000
`
`3500
`
`4000
`
`Figure 7: Cache misses as a function of cache size.
`
`than LRU using a cache half the size. This is partic-
`ularly important for machines that do not have large
`amounts of memory available for ﬁle caching. Even
`for large memory machines, the ability to achieve sim-
`ilar performance using smaller cache sizes results in
`more memory for applications. This also indicates that
`the number of correctly prefetched pages more than
`offsets any pages incorrectly forced out of the cache by
`prefetching, even for small cache sizes.
`For this particular trace, both LRU and prefetching
`realize relatively little improvement in the miss ratios
`for caches larger than 4 MB2. However, although LRU
`performance begins to approach prefetch performance
`as cache size increases, simulations out to cache sizes
`of 20 MB still show that prefetching results in an 11%
`reduction in the number of misses as compared to LRU.
`
`8 Conclusions
`
`Our results show that reasonable predictions can be
`made based on past ﬁle activity. As a result, auto-
`matic prefetching can substantially reduce I/O latency,
`make better use of the available bandwidth via batched
`prefetch requests, and improve cache utilization. As
`wide area distributed ﬁle systems, CDROM, RAID,
`
`

`[3] James Grifﬁoen and Randy Appleton. Automatic
`Prefetching in a WAN.
`In Proceedings of the
`IEEE Workshop on Advances in Parallel and Dis-
`tributed Systems, pages 8 – 12, Oct 1993.
`
`[14] M. Satyanarayanan. Coda: A Highly Available
`File System for a Distributed Workstation Envi-
`ronment.
`IEEE Trans. on Computers, 39:447–
`459, April 1990.
`
`[15] Peter Skopp and Gail Kaiser. Disconnected Oper-
`ation in a Multi-User Software Development En-
`vironment. In Proceedings of the IEEE Workshop
`on Advances in Parallel and Distributed Systems,
`pages 146–151, October 1993.
`
`[16] A. Smith. Cache memories. Computing Surveys,
`14(3), September 1982.
`
`and
`[17] R. van Renesse, A. S. Tanenbaum,
`A. Wilschut. The Design of a High Performance
`File Server. Proceedings of the IEEE 9th Inter-
`national Conference on Distributed Computing
`Systems, 1989.
`
`10 Author Information
`
`James Grifﬁoen is an Assistant Professor in the Com-
`puter Science Department at the University of Ken-
`tucky. He received a B.A. in computer science from
`Calvin College in 1985, and his M.S. and Ph.D in
`computer science from Purdue University in 1988
`and 1991 respectively. He was the recipient of
`the ’89-’90 USENIX scholarship. His research in-
`terests include high-performance distributed ﬁle sys-
`tems, scalable distributed shared memory systems, and
`high-speed network protocols. His email address is
`griff@dcs.uky.edu.
`
`Randy Appleton is a Ph.D student in the Computer
`Science Department at the University of Kentucky. He
`received his B.S. degree from the University of Illinois
`in 1989 and his M.S. from the University of Kentucky
`in 1992. His research interests are distributed ﬁle sys-
`tems, operating systems, and databases. His email
`address is randy@dcs.uky.edu.
`
`[4] D. Kotz and C. Ellis. Prefetching in ﬁle systems
`for MIMD multiprocessors. IEEE Transactions
`on Parallel and Distributed Systems, 1:218–230,
`1990.
`
`[5] Geoff Kuenning, Gerald J. Popek, and Peter Rei-
`her. An Analysis of Trace Data for Predictive File
`Caching in Mobile Computing. In Proceedings
`of the 1994 Summer USENIX Conference, June
`1994.
`
`[6] Samuel J. Lefﬂer, Marshal K. Mc Kusick,
`Michael J. Karels, and John S. Quarterman. The
`Design and Implementation of the 4.3 BSD Unix
`Operating System. Addison Wesley, 1989.
`
`[7] J. Morris, M. Satyanarayanan, M. Conner,
`J. Howard, D. Rosenthal, and F. Smith. Andrew:
`A Distributed Personal Computing Environment.
`CACM, 29:184–201, March 1986.
`
`[8] M. Nelson, B. Welch, and J. Ousterhout. Caching
`in the Sprite network ﬁle system. ACM Transac-
`tions on Computer Systems, 6(1):134–154,Febru-
`ary 1988.
`
`[9] J. Ousterhout, Da Costa, H. Harrison, J Kunze,
`M. Kupfer, and J. Thompson. A Trace-Driven
`Analysis of the Unix 4.2 BSD File System. In
`Proceedings of the 10th Symposium on Operat-
`ing Systems Principles, pages 15–24, December
`1985.
`
`[10] John K. Ousterhout. Why Aren’t Operating Sys-
`tems Getting Faster As Fast as Hardware?
`In
`Proceedings of the Summer 1990 USENIX Con-
`ference, pages 247–256, June 1990.
`
`[11] H. Patterson, G. Gibson, and M. Satyanarayanan.
`A Status Report on Research in Transparent In-
`formed Prefetching. SIGOPS, Operating Systems
`Review, 27(2):21–34, April 1993.
`
`and
`[12] D. Presotto, R. Pike, K. Thompson,
`In
`H. Trickey. Plan 9, A Distributed System.
`Proceedings of the Spring 1991 EurOpen Conf.,
`pages 43–50, May 1991.
`
`[13] R. Sandberg, D. Goldberg, S. Kleiman, Dan
`Walsh, and Bob Lyon. Design and Implementa-
`tion of the Sun Network File System. In Proceed-
`ings of the Summer USENIX Conference, pages
`119–130. USENIX Association, June 1985.
`
`Adobe - Exhibit 1018, page 11
`
`

This document is available on Docket Alarm but you must sign up to view it.

Or .

Accessing this document will incur an additional charge of $.

After purchase, you can access this document again without charge.

Accept $ Charge

Still Working On It

This document is taking longer than usual to download. This can happen if we need to contact the court directly to obtain the document and their servers are running slowly.

Give it another minute or two to complete, and then try the refresh button.

A few More Minutes ... Still Working

It can take up to 5 minutes for us to download a document if the court servers are running slowly.

Thank you for your continued patience.

This document could not be displayed.

We could not find this document within its docket. Please go back to the docket page and check the link. If that does not work, go back to the docket and refresh it to pull the newest information.

Your account does not support viewing this document.

You need a Paid Account to view this document. Click here to change your account type.

Your account does not support viewing this document.

Set your membership status to view this document.

With a Docket Alarm membership, you'll get a whole lot more, including:

Up-to-date information for this case.
Email alerts whenever there is an update.
Full text search for other cases.
Get email alerts whenever a new case matches your search.

Become a Member

One Moment Please

The filing “” is large (MB) and is being downloaded.

Please refresh this page in a few minutes to see if the filing has been downloaded. The filing will also be emailed to you when the download completes.

Your document is on its way!

If you do not receive the document in five minutes, contact support at support@docketalarm.com.

Sealed Document

We are unable to display this document, it may be under a court ordered seal.

If you have proper credentials to access the file, you may proceed directly to the court's system using your government issued username and password.

Access Government Site

We are redirecting you
to a mobile optimized page.

Document Unreadable or Corrupt

Refresh this Document
Go to the Docket

We are unable to display this document.

Refresh this Document
Go to the Docket

Supplemental Search

Search for PTAB Motions

PTAB Analytics

TTAB Analytics

Basic Search

Filters

Party Search

Advanced

Selected Courts

Recently Selected Courts

Find PTAB Decisions

PTAB Analytics

Special PTAB Alerts

Orange Book

Directly Search Federal Courts

Search Trademark ...

This document is available on Docket Alarm but you must sign up to view it.

Accessing this document will incur an additional charge of $.

Still Working On It

A few More Minutes ... Still Working

This document could not be displayed.

Your account does not support viewing this document.

You need a Paid Account to view this document. Click here to change your account type.

Your account does not support viewing this document.

One Moment Please

Your document is on its way!

Sealed Document

We are redirecting youto a mobile optimized page.

Document Unreadable or Corrupt

We are unable to display this document.

STEP 2 of 2

Choose your membership type

Flat-Fee

Pay-As-You-Go Monthly

Add your payment information

Login or Join

Enter your corporate Email

Thousands of your peers are saving time and gaining a competitive advantage with Docket Alarm.

Join Docket Alarm to perform smarter legal research.

Download this document and millions of others instantly with a Docket Alarm membership.

Join Docket Alarm and start performing smarter legal research.

Start tracking this docket instantly with a Docket Alarm membership.

Join thousands of your peers and start performing smarter legal research.

STEP 1 of 2

Millions of Documents | 15 Seconds to Signup

Hi !

Welcome to Docket Alarm

Welcome to Docket Alarm!

Explore Litigation Insights andManage Your Cases

Reset Password

What is PACER?

Why do I need it?

What will I be charged?

Do other courts have fees?

Basic Free Access

Welcome

Thank you

Check Firm Account

We are redirecting you
to a mobile optimized page.

Explore Litigation Insights and
Manage Your Cases