Exploring the World of Caching: Boosting Software Performance with a Go Cache Library

8 min readSep 14, 2023

When it comes to software development, efficiency is key. One way to improve the performance of your applications is to use caching mechanisms. Caching plays a key role in optimizing resource-intensive processes, and in this article, we’ll explore the world of caching its benefits, and when and where it can be most beneficial.

What is a Cache?

A cache is a temporary storage location that holds frequently accessed data, allowing faster access and reduced latency. It typically stores data that is costly or time-consuming to retrieve, making it readily available for future requests. The benefits of caching are:

Speed and Reduced Lantecy: Caching significantly improves application speed by reducing the time it takes to retrieve data. When data is cached, it can be quickly retrieved from memory rather than having to query a database, make computationally intensive calculations, send API requests, etc.
Scalability: Caches improve the scalability of the application by reducing the load from the backend database to the in-memory layer. With cached data readily available, your application can handle more requests and serve more users efficiently.
Cost-Efficiency: Caching can reduce operational costs associated with database queries or external API requests. By minimizing resource usage, you can save on infrastructure costs.

Use Cases for Caching

Now that we understand the benefits of caching, let’s explore some common use cases where it can be a game-changer.

Caching Web Pages

A classic use case is to cache web pages, both on the server and client side. The idea is to store frequently accessed files to dramatically reduce page load times, resulting in a better user experience.

In server-side caching, web servers store generated web pages or page fragments in a cache. When a user requests a page, the server first checks if a cached version of that page exists. If it does, the server can deliver the cached page directly, bypassing the need to regenerate the entire page. This reduces server load and response time.

In client-side caching, web browsers store resources like HTML, CSS, JavaScript, and images locally. When a user visits a website, the browser checks its cache for these resources before requesting them from the server. If cached, the browser uses the locally stored copies, significantly reducing page load times.

API Response Caching

In API response caching, the responses from API endpoints are cached to reduce the load on the server and improve response times. When a request is made to an API endpoint, the cache is checked first. If the requested data is in the cache and hasn’t expired, the API can return the cached response directly.

In this way when the same request is made multiple times, a cache can store the response data, saving time and resources. This is especially useful for APIs serving static or slowly changing data.

Database query caching

In this scenario, the cache stores the results of frequently executed database queries. When an application needs specific data, it first checks the cache. If the data is cached and still valid, it’s retrieved from the cache instead of querying the database. If the data isn’t in the cache or has expired, the database query is executed.

This approach can relieve the database server of repetitive and resource-intensive queries. This is critical for applications with a high volume of database transactions.

When Not to Use Caching

Caching is a versatile tool that can significantly improve the performance of your application, but it’s important to carefully consider when and where to use caching to maximize its benefits while avoiding potential pitfalls.

Real-time data: Caching is not suitable for real-time data where immediate updates are critical. In such cases, caching may result in outdated information being presented to users.
Limited memory resources: If your system has extremely limited memory resources, caching may not be the best option as it could lead to memory-related problems.
Multi-instance services: Caching can be challenging in multi-instance or distributed environments because cache consistency becomes an issue. You must ensure that cached data remains synchronized across all instances.

Cache Design and Implementation

To begin the journey of implementing a cache in your Go application, it’s important to first understand the core concepts and design considerations that will shape your cache solution. Let’s start by discussing the basic steps and considerations involved in building an efficient and reliable cache system.

Data Structure

The cache should provide a data structure for which it is possible to store and retrieve items with a unique key. This key is used to quickly retrieve the associated data when needed. Also, this operation should take an average O(logN) time, where N is the number of keys.

Each item or key-value pair in the cache should be associated with a fixed expiry time or “time-to-live” (TTL). This TTL represents the duration for which the item will be considered valid and available in the cache. Once the TTL for a specific item expires, it becomes stale, and the cache will no longer serve it as a valid response.

Finally, the cache must efficiently manage concurrent access without causing data inconsistencies or performance bottlenecks. Moreover, it should be designed to be lock-free, meaning it allows multiple threads or processes to access cached data concurrently without imposing locks or synchronization mechanisms. Reading, writing, and eviction must be implemented as atomic operations.

type Cache struct {
 stop chan struct{} // Used to signal the cache to stop or shut down gracefully.
 wg   sync.WaitGroup

 mu                sync.RWMutex // Mutual exclusion lock for concurrent access.
 items             map[string]item // The key-value storage of the cache.
 defaultExpiration time.Duration // Cache's default expiration (TTL).
}

type item struct {
  object     any
  expiration int64 // TTL for single item.
}

Initialisation and Termination

A NewCache() function that creates a cache object with optional expiration settings and an optional cache cleanup routine running in the background. The cache can be used to store key-value pairs, and the cleanup routine periodically checks for and removes expired items from the cache.

func NewCache(defaultExpiration, cleanupInterval time.Duration) *Cache {
 // Check if the default expiration is non-positive; if so, use NoExpiration
 if defaultExpiration <= 0 {
  defaultExpiration = NoExpiration
 }

 // Initialize the Cache struct with its fields
 c := &Cache{
  stop:              make(chan struct{}), // Channel for stopping cleanup routine
  mu:                sync.RWMutex{},      // Read-Write Mutex for synchronization
  items:             make(map[string]item), // Map for storing cached items
  defaultExpiration: defaultExpiration,     // Default expiration time for items
 }

 // If cleanupInterval is greater than zero, set up a periodic cleanup routine
 if cleanupInterval > 0 {
  // Increment the wait group to track the cleanup goroutine
  c.wg.Add(1)

  // Start a new goroutine for cleanup with the specified interval
  go func(cleanupInterval time.Duration) {
   defer c.wg.Done() // Decrement the wait group when done
   c.cleanUp(cleanupInterval) // Perform cache cleanup
  }(cleanupInterval)
 }

 // Return the initialized Cache object
 return c
}

A Stop() method to gracefully terminate the cache and ensure that any ongoing cleanup activities are completed before proceeding.

func (c *Cache) Stop() {
 // Close the 'c.stop' channel to signal the cleanup routine to stop gracefully.
 close(c.stop)

 // Wait for the cleanup routine to finish by waiting on the 'c.wg' WaitGroup.
 c.wg.Wait()
}

Access

A Set() function responsible for adding a key-value pair to the cache while ensuring thread safety through the use of a mutex (locking and unlocking) to prevent concurrent access issues.

func (c *Cache) Set(key string, object any, duration time.Duration) {
 var expiration int64

 // If the provided duration is the default, use the cache's default expiration.
 if duration == DefaultExpiration {
  duration = c.defaultExpiration
 }

 // Calculate the expiration time based on the provided duration.
 if duration > 0 {
  expiration = time.Now().Add(duration).UnixNano()
 }

 c.mu.Lock() // Acquire a lock to ensure thread safety.
 defer c.mu.Unlock() // Release the lock when the function exits.

 // Create or update the cache entry for the given key.
 c.items[key] = item{
  object:     object,
  expiration: expiration,
 }
}

A Get() function that retrieves a cached value associated with a key while ensuring thread safety through the use of a read lock (locking and unlocking). It should also check whether the cache entry is still valid based on its expiration time.

func (c *Cache) Get(key string) (any, bool) {
 c.mu.RLock()       // Acquire a read lock to ensure thread safety.
 defer c.mu.RUnlock() // Release the read lock when the function exits.

 item, found := c.items[key] // Retrieve the cache item associated with the given key.

 // Check if the item exists and is not expired.
 isExpired := item.expiration > 0 && item.expiration <= time.Now().UnixNano()
 if !found || isExpired {
  return nil, false // If the item doesn't exist or is expired, return (nil, false).
 }

 return item.object, true // Return the cached object and true, indicating success.
}

A Delete() function that allows to removal of a specific cache entry by its key while ensuring thread safety through the use of a mutex (locking and unlocking). And a Flush() function that provides a way to reset the cache by removing all entries

func (c *Cache) Delete(key string) {
 c.mu.Lock()          // Acquire a lock to ensure thread safety.
 defer c.mu.Unlock()  // Release the lock when the function exits.

 delete(c.items, key) // Delete the cache entry for the given key.
}

func (c *Cache) Flush() {
 c.mu.Lock()          // Acquire a lock to ensure thread safety.
 defer c.mu.Unlock()  // Release the lock when the function exits.

 c.items = map[string]item{} // Replace the existing map with an empty map, effectively clearing the cache.
}

Eviction

Finally, a cleanUp() function which will run as a background goroutine responsible for periodically removing expired cache entries based on their expiration timestamps. It ensures that the cache remains efficient and doesn’t accumulate stale data over time. The use of a ticker provides a controlled and periodic cleanup mechanism.

func (c *Cache) cleanUp(cleanupInterval time.Duration) {
 t := time.NewTicker(cleanupInterval)
 defer t.Stop()

 for {
  select {
  case <-c.stop:
   return
  case <-t.C:
   c.mu.Lock() // Acquire a lock to ensure thread safety during cleanup.
   for key, object := range c.items {
    if object.expiration > 0 && object.expiration <= time.Now().UnixNano() {
     delete(c.items, key) // Delete expired cache entries.
    }
   }
   c.mu.Unlock() // Release the lock after cleanup.
  }
 }
}

You can find the full source code for this cache library on my GitHub. Feel free to explore the code, contribute, or use it in your projects.

GitHub — J4NN0/go-cache: Thread safe in-memory key-value cache library for Go suitable for single…

Thread safe in-memory key-value cache library for Go suitable for single instance microservices — GitHub …

github.com

Conclusion

In summary, implementing a key-value cache in Go can greatly improve the performance and efficiency of your applications. Whether it’s web page caching, API response caching, or reducing database load, a well-designed cache system can be a game changer. Not only does it improve response times, it also reduces resource consumption.

Now that we’ve explored the key concepts, benefits, and technical aspects of caching, you’ll know how to build your cache library tailored to your application’s needs. By harnessing the power of caching, you can increase the speed, responsiveness, and scalability of your software, providing a smoother and more satisfying user experience.