(NOTE: All chapters conclude with a Summary.)

Preface.
Intended Audience.Organization of the Book.

Acknowledgments.

Introduction.
The Basics of Web Caching.The Basics of Web Replication.Beyond Performance.

I. BACKGROUND.

1. Network Layers and Protocols.
The ISO/OSI Reference Model.Network Components at Different Layers.Overview of Internet Protocols.2. The Internet Protocol and Routing.
Addressing.IP Datagram Header.Routing.Routing within ASs.Routing between ASs.Multicast.3. Transmission Control Protocol.
Segment Header.Opening a Connection.Closing a Connection.Flow Control.Congestion Control.Retransmission.4. Application Protocols for the Web.
Uniform Resource Locators.The Domain Name System.Name Hierarchy.The DNS Protocol.The HyperText Transfer Protocol.The HTTP Request.The HTTP Response.The HTTP Message Exchange.Hyperlinks and Embedded Objects.5. HTTP Support for Caching and Replication.
Conditional Requests.Conditional Headers Used for Caching.Conditional Headers Used for Replication.Age and Expiration of Cached Objects.Request Redirection.Range Requests.The Cache-Control Header.Cache-Control Header Directives in Requests.Cache-Control Header Directives in Responses.Example of the Cache-Control Header.Storing State for a Stateless Server: Cookies.Support for Server Sharing.Expanded Object Identifiers.Learning the Proxy Chain.Cacheability of Web Content.6. Web Behavior Rules of Thumb.
Evaluation Methods.Live Measurements.Trace-Based Methods.Benchmarking.Object Size.Object Types and Cacheability.Object Popularity.Locality of Reference.Temporal Locality.Spatial Locality.Rate of Object Modifications.Other Observations.

II. WEB CACHING.

7. Proxy Caching: Realistic Expectations.
Do Proxy Caches Deserve a Hearing?Latency Reduction.An Optimistic Bound on Latency Reduction.A Pessimistic View of Latency Reduction.TCP Connection Caching.Connection Caching versus Data Caching.TCP Connection Splitting.Environment-Specific TCP Optimizations.Bandwidth Savings.Proxies and Streaming Media.8. Proxy Deployment.
Overview of Internet Connectivity Architectures.Nontransparent Proxy Deployment.Explicit Client Configuration.Browser Autoconfiguration.Proxy Auto-Discovery.Transparent Proxy Deployment.Multipath Problem.Interception Mechanisms.Layer 4 Switch as an Intercepter.Router as an Intercepter.Layer 7 Switch as an Intercepter.Intercepting Link.Performance Pitfalls.Security and Access Control Issues.Proxies and Web Server Access Control.Proxies and Security.9. Cooperative Proxy Caching.
Shared Cache: How Big Is Big Enough?Issues in Cooperative Proxy Caching.Location Management.Broadcast Queries.Hierarchical Caching.URL Hashing.Directory-Based Cooperation.Directory Structures.Caching on a Global Scale: Proxy Pruning.System Model.Cache Routing.Vicinity Caching.An Overview of Existing Platforms.Cache Hierarchies.Caching as a Service of a Network Access Point.Satellite Broadcast Cache Service.10. Cache Consistency.
Cache Validation.The Basic Validation Scenario.Implicit Time to Live.Fine-Tuning Validation.Asynchronous and Piggyback Cache Validation.Cache Invalidation.Leases.Subscriptions.Delayed versus Immediate Updates.Volumes.Volume Lease Protocols.Piggyback and Delayed Invalidation.Invalidation in Cache Routing.Issues in Cooperative Cache Consistency.Validation with Cooperative Proxies.Non-Monotonic Delivery Problem.11. Replacement Policy.
Replacement Policy Metrics.Replacement Policy Algorithms.The Value of Replacement Policy.12. Prefetching.
Performance Metrics.Performance Bounds of Prefetching.Taxonomy.Nondata Prefetching.Nontransparent Prefetching.User Nontransparency.Server Nontransparency.Server Push versus Client Pull.Information Used in Prefetching Algorithms.User-Specific Information.Group Information.Multiuser Information.Prediction Algorithms.Popularity-Based Predictions.Markov Modeling.Examples of Algorithms using First-order Markov Modeling.Exploiting Longer Request Sequences.Structure Algorithms.13. Caching the Uncacheable.
A Note on Implementation.Modified Content and Stale Delivery Avoidance.Cache-Friendly Approaches to Stale Delivery Avoidance.Utilizing Cached Stale Content.Cookied Content.Cache-Friendly Usage of Cookies.Caching Cookied Content.The Semantic Transparency Issue.Expressly Uncacheable Content and Hit Metering.Cache-Friendly Approaches to Hit Metering.Caching Expressly Uncacheable Content.Dynamic Content.Cache-Friendly Design of Dynamic Content.Base Instance Caching.Template Caching.Base-Instance Caching versus Template Caching.Active Proxies.

III. WEB REPLICATION.

14. Basic Mechanisms for Request Distribution.
Content-Blind Request Distribution with Full Replication.Client Redirection.Redirection by a Balancing Switch.Redirection by a Web Site's DNS.Anycast.Content-Blind Request Distribution with Partial Replication.Using Surrogates as Server Replicas.Back-End Distributed File Systems.Content-Aware Request Distribution.Client Redirection by a Java Applet.HTTP Redirection.Redirection by an L7 Switch.Fine-Granularity Domain Names.15. Content Delivery Networks.
Types of CDNs.Delivering Requests to a CDN.Finding Origin Servers.Request Distribution in CDNs.DNS/Balancing Switch Redirection.Two-Level DNS Redirection.Anycast/DNS Redirection.Pitfalls of DNS-Based Request Distribution.Fine-Tuning DNS Request Distribution.Post-DSN Request Distribution by Triangular Communication.Post-DNS Request Distribution with HTTP Redirection and HTML Rewriting.Data Consistency in CDNs.Streaming Content Delivery.Using Multicast for Streaming Content Delivery.Using Application-Level Multicast for Streaming Content Delivery.Constructing a Distribution Tree.Supporting Secure Content Access.SSL Overview.Performance Impact of Supporting SSL in a CDN.Key Management.Content Retrieval from the Origin Server.16. Server Selection.
Metrics.Proximity Metrics.Server Load Metrics.Aggregate Metrics.Internet Mapping Services.Aging of Metrics.Algorithms.Obtaining Passive Measurements.Avoiding Oscillations.Supporting Client Stickiness.Respecting the Affinity of Server Caches.Server Selection with Multiple Metrics.DNS-Based Server Selection.A Typical DNS Server Selection Scheme.Estimating Hidden Load Factors.Why Choose a Server When You Can Have Them All?

IV. FURTHER DIRECTIONS.

17. Adding Value at the Edge.
Content Filtering.Content Transcoding.Watermarking.Custom Usage Reporting.Implementing New Services with an Edge Server API.The ICAP Protocol.Distributed Web Applications.How to Replicate Applications.Where to Replicate Applications.18. Content Distribution Internetworking.
Pros and Cons of CDI.Request Distribution.Content Distribution.Accounting.Glossary.
Bibliography.
Index. 0201615703T12172001

Promotional Information

In Web Caching and Replication, two AT&T Labs experts bring together real-world implementations, the latest IETF and W3C technical standards, and the field's most advanced research, offering systematic guidance for maximizing Web scalability and performance. The authors first introduce key concepts of Web caching and replication.They introduce HTTP's support for caching and replication, and broadly characterize Web behavior, offering practical rules of thumb for optimizing performance. Next, they focus on caching, helping implementers realistically assess proxy caching, deploy proxies in corporate and ISP networks; and utilize cooperative proxy caching. They offer in-depth insights into cache consistency, and caching the "uncacheable" -- including "cookied" and dynamic content. Part III introduces today's best techniques for web replication, offering in-depth coverage of content delivery networks, server selection, content placement, and more. For all networking professionals, IT professionals, and engineers involved with building or optimizing Internet services or equipment; for content delivery specialists and service providers; and for owners of large sites concerned about scalability and performance.

About the Author

Michael Rabinovich is "one of the few research-ers who are qualified to write this book" (Pei Cao, Cisco Systems). As a leading researcher at AT&T Labs, he has closely watched current trends and state-of-the-art standards and commercial products while both battling "real-life" technical problems and participating in new research in Web caching and replication. In addition to numerous publications in the areas of distributed systems, Web performance, and transaction management, he is a co-author of the CRISP distributed proxy cache.

Oliver Spatscheck is a senior technical staff member at AT&T Labs–Research, where he has been actively involved in building AT&T's content distribution network. His research publications span the areas of security, operating systems, and CDNs. He has also been active in the IETF, working most recently on Content Distribution Internetworking. He holds a Ph.D. in computer science from the University of Arizona.

0201615703AB11262001

Table of Contents

Promotional Information

About the Author