🔗 Feature: Disk Daemon (diskd) helper
- Status: Complete.
- Version: 2.4
🔗 What is DISKD?
DISKD refers to some features in Squid-2.4 and later to improve Disk I/O performance. The basic idea is that each cache_dir has its own diskd child process. The diskd process performs all disk I/O operations (open, close, read, write, unlink) for the cache_dir. Message queues are used to send requests and responses between the Squid and diskd processes. Shared memory is used for chunks of data to be read and written.
🔗 Does it perform better?
Yes. We benchmarked Squid-2.4 with DISKD at the Second IRCache Bake-Off. At the bakeoff, we got 160 req/sec with diskd. Without diskd, we’d have gotten about 40 req/sec.
On modern Linux systems the Disk Daemon has been trumped by extremely fast AUFS. diskd is still recommended for BSD variants. However, we may have found an implementation bug in squid which was hobbling AUFS on BSD.
🔗 How do I use it?
You need to run Squid version 2.4 or later. Your operating system must support message queues, and shared memory.
To configure Squid for DISKD, use the –enable-storeio option:
% ./configure --enable-storeio=diskd,ufs
🔗 FATAL: Unknown cache_dir type ‘diskd’
You didn’t put diskd in the list of storeio modules as described above. You need to run configure and and recompile Squid.
🔗 If I use DISKD, do I have to wipe out my current cache?
No. Diskd uses the same storage scheme as the standard “UFS” type. It only changes how I/O is performed.
🔗 How do I configure message queues?
Most Unix operating systems have message queue support by default. One way to check is to see if you have an ipcs command.
However, you will likely need to increase the message queue parameters for Squid. Message queue implementations normally have the following parameters:
- MSGMNB: Maximum number of bytes per message queue.
- MSGMNI: Maximum number of message queue identifiers (system wide).
- MSGSEG: Maximum number of message segments per queue.
- MSGSSZ: Size of a message segment.
- MSGTQL: Maximum number of messages (system wide).
- MSGMAX: Maximum size of a whole message. On some systems you may need to increase this limit. On other systems, you may not be able to change it.
The messages between Squid and diskd are 32 bytes for 32-bit CPUs and 40 bytes for 64-bit CPUs. Thus, MSGSSZ should be 32 or greater. You may want to set it to a larger value, just to be safe.
We’ll have two queues for each cache_dir – one in each direction. So, MSGMNI needs to be at least two times the number of cache_dirs. I’ve found that 75 messages per queue is about the limit of decent performance. If each diskd message consists of just one segment (depending on your value of MSGSSZ), then MSGSEG should be greater than 75.MSGMNB and MSGTQL affect how many messages can be in the queues at one time. Diskd messages shouldn’t be more than 40 bytes, but let’s use 64 bytes to be safe. MSGMNB should be at least 64*75. I recommend rounding up to the nearest power of two, or 8192.MSGTQL should be at least 75 times the number ofcache_dirs that you’ll have.
🔗 FreeBSD
Your kernel must have
options SYSVMSG
You can set the parameters in the kernel as follows. This is just an example. Make sure the values are appropriate for your system:
options MSGMNB=8192 # max # of bytes in a queue
options MSGMNI=40 # number of message queue identifiers
options MSGSEG=512 # number of message segments per queue
options MSGSSZ=64 # size of a message segment
options MSGTQL=2048 # max messages in system
🔗 OpenBSD
You can set the parameters in the kernel as follows. This is just an example. Make sure the values are appropriate for your system:
option MSGMNB=16384 # max characters per message queue
option MSGMNI=40 # max number of message queue identifiers
option MSGSEG=2048 # max number of message segments in the system
option MSGSSZ=64 # size of a message segment (Must be 2^N)
option MSGTQL=1024 # max amount of messages in the system
🔗 How do I configure shared memory?
Shared memory uses a set of parameters similar to the ones for message queues. The Squid DISKD implementation uses one shared memory area for each cache_dir. Each shared memory area is about 800 kilobytes in size. You may need to modify your system’s shared memory parameters:
- SHMSEG Maximum number of shared memory segments per process.
- SHMMNI Maximum number of shared memory segments for the whole system.
- SHMMAX Largest shared memory segment size allowed.
- SHMALL Total amount of shared memory that can be used.
For Squid and DISKD, SHMSEG and SHMMNI must be greater than or equal to the number of cache_dir’s that you have. SHMMAX must be at least 800 kilobytes. SHMALL must be at least 800 kilobytes multiplied by the number of cache_dir’s.
Note that some operating systems express SHMALL in pages, rather than bytes, so be sure to divide the number of bytes by the page size if necessary. Use the pagesize command to determine your system’s page size, or use 4096 as a reasonable guess.
🔗 FreeBSD
Your kernel must have
options SYSVSHM
You can set the parameters in the kernel as follows. This is just an example. Make sure the values are appropriate for your system:
options SHMSEG=16 # max shared mem id's per process
options SHMMNI=32 # max shared mem id's per system
options SHMMAX=2097152 # max shared memory segment size (bytes)
options SHMALL=4096 # max amount of shared memory (pages)
🔗 OpenBSD
OpenBSD is similar to FreeBSD, except you must use option instead of options, and SHMMAX is in pages instead of bytes:
option SHMSEG=16 # max shared mem id's per process
option SHMMNI=32 # max shared mem id's per system
option SHMMAX=2048 # max shared memory segment size (pages)
option SHMALL=4096 # max amount of shared memory (pages)
🔗 Sometimes shared memory and message queues aren’t released when Squid exits.
Yes, this is a little problem sometimes. Seems like the operating system gets confused and doesn’t always release shared memory and message queue resources when processes exit, especially if they exit abnormally. To fix it you can “manually” clear the resources with the ipcs command. Add this command into your RunCache or squid_start script:
ipcs | awk '/squid/ {printf "ipcrm -%s %s\n", $1, $2}' | /bin/sh
🔗 What are the Q1 and Q2 parameters?
In the source code, these are called magic1 and magic2. These numbers refer to the number of oustanding requests on a message queue. They are specified on the cache_dir option line, after the L1 and L2 directories:
cache_dir diskd /cache1 1024 16 256 Q1=72 Q2=64
If there are more than Q1 messages outstanding, then Squid will intentionally fail to open disk files for reading and writing. This is a load-shedding mechanism. If your cache gets really really busy and the disks can not keep up, Squid bypasses the disks until the load goes down again.
If there are more than Q2 messages outstanding, then the main Squid process “blocks” for a little bit until the diskd process services some of the messages and sends back some replies.
Reasonable Q1 and Q2 values are 64 and 72. If you would rather have good hit ratio and bad response time, set Q1 > Q2. Otherwise, if you would rather have good response time and bad hit ratio, set Q1 < Q2.
Categories: Feature
Navigation: Site Search, Site Pages, Categories, 🔼 go up