This is a high performance Golang HTTP middleware for server-side application layer caching, ideal for REST APIs.
It is simple, super fast, thread safe and gives the possibility to choose the adapter (memory, Redis, DynamoDB etc).
The memory adapter minimizes GC overhead to near zero and supports some options of caching algorithms (LRU, MRU, LFU, MFU). This way, it is able to store plenty of gigabytes of responses, keeping great performance and being free of leaks.
go get github.com/victorspringer/http-cache
This is an example of use with the memory adapter:
package main
import (
"log"
"net/http"
"time"
"github.com/victorspringer/http-cache"
"github.com/victorspringer/http-cache/adapter/memory"
)
func example(w http.ResponseWriter, r *http.Request) {
w.Write([]byte("Ok"))
}
func main() {
memcached, err := memory.NewAdapter(
memory.AdapterWithAlgorithm(memory.LRU),
memory.AdapterWithCapacity(10000000),
)
if err != nil {
log.Fatal(err)
}
cacheClient, err := cache.NewClient(
cache.ClientWithAdapter(memcached),
cache.ClientWithTTL(10 * time.Minute),
cache.ClientWithRefreshKey("opn"),
)
if err != nil {
log.Fatal(err)
}
handler := http.HandlerFunc(example)
http.Handle("/", cacheClient.Middleware(handler))
http.ListenAndServe(":8080", nil)
}Example of Client initialization with Redis adapter:
import (
"github.com/victorspringer/http-cache"
"github.com/victorspringer/http-cache/adapter/redis"
)
...
ringOpt := &redis.RingOptions{
Addrs: map[string]string{
"server": ":6379",
},
}
cacheClient, err := cache.NewClient(
cache.ClientWithAdapter(redis.NewAdapter(ringOpt)),
cache.ClientWithTTL(10 * time.Minute),
cache.ClientWithRefreshKey("opn"),
)
...Use Drop to release the cached response that matches a request. The same cache key rules used by the middleware are applied, including normalized query params, POST bodies and configured vary headers.
req, err := http.NewRequest(http.MethodGet, "https://example.com/products?page=1", nil)
if err != nil {
log.Fatal(err)
}
if err := cacheClient.Drop(req); err != nil {
log.Fatal(err)
}PURGE support is opt-in so existing applications that already handle PURGE keep working as before.
Authentication is your responsibility. This middleware does not authenticate
PURGErequests; any caller that can reach the endpoint can flush cache entries. The same applies toClientWithRefreshKey— any query string carrying the configured key will release the matching entry. Place an auth middleware (basic auth, signed token, source-IP allowlist, etc.) in front of the cache middleware before exposing either feature on a public surface.
cacheClient, err := cache.NewClient(
cache.ClientWithAdapter(memcached),
cache.ClientWithTTL(10 * time.Minute),
cache.ClientWithPurge(),
)When enabled, a matching PURGE request releases the cached response and returns 204 No Content. For endpoints cached by POST body, send the PURGE request with the same body so the cache keys match.
Use ClientWithObserver to receive cache middleware events. The event includes the request, cache key, event type and status code when available.
cacheClient, err := cache.NewClient(
cache.ClientWithAdapter(memcached),
cache.ClientWithTTL(10 * time.Minute),
cache.ClientWithObserver(func(event cache.CacheEvent) {
log.Printf("cache event=%s key=%d status=%d path=%s",
event.Type,
event.Key,
event.StatusCode,
event.Request.URL.Path,
)
}),
)Available event types are hit, miss, stale, refresh, store and purge.
ClientWithMethodsenables caching forGETand/orPOSTrequests.ClientWithVaryHeadersincludes selected request headers in the cache key.ClientWithStatusCodeFiltercontrols which response status codes can be cached.ClientWithSkipCacheResponseHeaderskips storage when a response includes a configured header.ClientWithSkipCacheURIPathRegexskips lookup and storage for matching URL paths.ClientWithExpiresHeaderwrites the cached response expiration as anExpiresheader.ClientWithMaxBodySize(n)caps the response body bytes the middleware will buffer and cache. Responses larger thannare still streamed to the client untouched, but their buffered copy is dropped and the entry is not stored. Recommended for any endpoint that can emit large payloads (downloads, streaming responses).
ClientWithSingleflight coalesces concurrent misses for the same cache key so the origin handler runs only once per stampede. All concurrent callers receive the same response. Disabled by default — opt in if your origin is expensive enough that an N-way concurrent miss is a real concern.
cacheClient, err := cache.NewClient(
cache.ClientWithAdapter(memcached),
cache.ClientWithTTL(10 * time.Minute),
cache.ClientWithSingleflight(),
)ClientWithRespectCacheControl makes the middleware honor a useful subset of RFC 7234:
- Request
Cache-Control: no-store→ bypass the cache entirely (no lookup, no store). - Request
Cache-Control: no-cache→ skip the lookup; the handler runs against the origin and the response is still stored. - Response
Cache-Control: no-store,no-cache, orprivate→ do not store the response (shared-cache semantics). - Response
Cache-Control: s-maxage=N/max-age=N→ override the client default TTL for this single response (s-maxagewins).
Unknown directives are intentionally ignored, so any extension your application emits keeps its existing behavior.
ClientWithStaleWhileRevalidate(window) implements RFC 5861 stale-while-revalidate. An expired entry whose age is no greater than window is served from cache immediately while a single background goroutine refreshes it. Concurrent stale hits coalesce to one origin call. The refresh outlives the triggering request, so a client disconnect does not abort the refill.
cacheClient, err := cache.NewClient(
cache.ClientWithAdapter(memcached),
cache.ClientWithTTL(1 * time.Minute),
cache.ClientWithStaleWhileRevalidate(10 * time.Second),
)The benchmarks were based on allegro/bigcache tests and used to compare it with the http-cache memory adapter.
The tests were run using an Intel i5-2410M with 8GB RAM on Arch Linux 64bits.
The results are shown below:
cd adapter/memory/benchmark
go test -bench=. -benchtime=10s ./... -timeout 30m
BenchmarkHTTPCacheMamoryAdapterSet-4 5000000 343 ns/op 172 B/op 1 allocs/op
BenchmarkBigCacheSet-4 3000000 507 ns/op 535 B/op 1 allocs/op
BenchmarkHTTPCacheMamoryAdapterGet-4 20000000 146 ns/op 0 B/op 0 allocs/op
BenchmarkBigCacheGet-4 3000000 343 ns/op 120 B/op 3 allocs/op
BenchmarkHTTPCacheMamoryAdapterSetParallel-4 10000000 223 ns/op 172 B/op 1 allocs/op
BenchmarkBigCacheSetParallel-4 10000000 291 ns/op 661 B/op 1 allocs/op
BenchmarkHTTPCacheMemoryAdapterGetParallel-4 50000000 56.1 ns/op 0 B/op 0 allocs/op
BenchmarkBigCacheGetParallel-4 10000000 163 ns/op 120 B/op 3 allocs/ophttp-cache writes are slightly faster and reads are much more faster.
cache=http-cache go run benchmark_gc_overhead.go
Number of entries: 20000000
GC pause for http-cache memory adapter: 2.445617ms
cache=bigcache go run benchmark_gc_overhead.go
Number of entries: 20000000
GC pause for bigcache: 7.43339mshttp-cache memory adapter takes way less GC pause time, that means smaller GC overhead.
- Develop gRPC middleware
- Develop Badger adapter
- Develop DynamoDB adapter
- Develop MongoDB adapter
http-cache is released under the MIT License.