Kubernetes egress control with squid proxy

(interlaye.red)

27 points | by fsmunoz 2 hours ago

8 comments

  • merpkz 54 minutes ago
    You don't need a sidecar to stream logs of squid, that's anti-pattern, instead just tell squid to write logs to /dev/stdout, like this:

      logfile_rotate 0
      cache_log stdio:/dev/stdout
      access_log stdio:/dev/stdout
      cache_store_log stdio:/dev/stdout
    
    Running squid in container is a bit tricky, since it is indeed an ancient piece of software, but I have managed to run it successfully before with squid configuration like this:

      max_filedescriptors 1048576
      pid_filename /dev/shm/squid.pid
      cache_effective_user squid
      cache_effective_group squid
    
    and deployment has these set, - UID 31 is squid user inside of container

      securityContext:
      runAsUser: 31
      runAsGroup: 31
      fsGroup: 31
      command: ["sh","-c","squid -z && sleep 3s; squid -N"]
    • fsmunoz 19 minutes ago
      That's a more elegant approach. I usually just plow through obstacles, and the end result is not always ideal -- I like your approach better than the sidecar, I guess that I was using sidecars for other things and it sort of influenced my approach.

      I'll try it your suggestions out and update the article, and thank you for your comment, already made sharing this worth it.

      • merpkz 2 minutes ago
        Don't even mention it, I have never used NetworkPolicy before, but now it seems like exactly the thing I am missing on my clusters to limit the blast radius if anything gets owned. It's quite incredible the amount of nftables firewall rules the k3s daemon just created for that example policy in your blog, now I am in rabbit hole trying to figure out how this all actually works under the hood. Thanks for this writeup!
  • baobun 57 minutes ago
    Not just squid but mostly any http proxy can be run in forward mode if you want.

    Caddys "magic TLS" can be neat for this if you actually do want to dynamically intercept those https connections in an easy way. It's a use-case where Caddy really shines. You can go nuts trying to configure that cleanly in squid. The docs (perhaps intentionally) make you work for the hidden knowledge of these dark arts. You also get modernities like builtin http2, http3, etc.

    Nobody else bothered by squids very lengthy restart time or have I just never configured it properly?

    (Not to dunk on squid, it's otherwise mostly great. Especially for its caching features)

    • fsmunoz 2 minutes ago
      I've used Caddy for some of my projects (e.g. https://github.com/fsmunoz/parlamentodb/blob/54e0b252485905e... ), but not for this intercept approach you mentioned, I will give it a look!

      I'm not bothered by restart times but that's mostly because that has never been a priority... but one thing I have half-done is a controller that gathers per-namespace configs, and with that reload times will become more of an issue.

      Part of the reason I chose Squid here was precisely because I found it interesting to reuse something that was such a staple of web architecture patterns.

  • e-Minguez 13 minutes ago
    This is great! The only downside is that the app needs to understand proxies.
  • crimsonnoodle58 52 minutes ago
    We use squid for egress control on Kubernetes and have also written a controller that runs in a sidecar container next to squid that monitors for custom CRD's, such as a whitelists.

    The controller then updates squid.conf and reloads squid. This allows pods/namespaces to define their own whitelists.

    The great thing about using squid and disabling DNS is you can stop DNS and HTTP exfil, but still allow certain websites to be accessible.

  • btreecat 1 hour ago
    I like this approach!

    I am struggling to lock down a pod in my home cluster to allow local connections to it's web UI but force all other connections through a VPN client. I'm going to investigate if I could use squid for this.

    My next approach is going to involve using a sidecar.

    One heads up to the author, the text based charts didn't render well on FF mobile. Text is meant to reflow based on screen size, typeface etc. I feel this is a great case for using a drawing/image instead.

    • baobun 30 minutes ago
      Using an http proxy like squid (or apache/haproxy/caddy/envoy/trafficserver/freenginx) does sound like what you should do next.

      If you need the pod to do outbound connections as well as receive incoming traffic, usually that would be two different proxies (forward and reverse, respectively). Unless you do some fancy p2p service mesh.

    • brynx97 1 hour ago
      I had challenges with split-DNS in my homelab k3s cluster trying to do this. I ended up just putting the apps in docker-compose on a VM that has static routes for my local homelab networks. I looked at tailscale to solve this since it has a kubernetes operator, but tailscale doesn't fit my use cases or work well with all of my devices.
      • btreecat 54 minutes ago
        > I had challenges with split-DNS in my homelab k3s cluster trying to do this. I ended up just putting the apps in docker-compose on a VM that has static routes for my local homelab networks. I looked at tailscale to solve this since it has a kubernetes operator, but tailscale doesn't fit my use cases or work well with all of my devices.

        I don't need tails scale for this, seems like overkill.

        I would like to better understand why my combination of marked packets and SOCK5 proxy are not fully working for certain UDP traffic. I also need to investigate if disabling ipv6 will help.

        Using a VM or docker compose when I have k3s feels like admitting defeat with out understanding why.

        • baobun 21 minutes ago
          > I would like to better understand why my combination of marked packets and SOCK5 proxy are not fully working for certain UDP traffic

          I think UDP support for SOCKS5 proxies and clients is very spotty, especially beyond DNS. Probably some bugs out there. That might go for UDP in more or less esoteric container networking setups too...

          If everything else fails, I've had the least hassle with socat, as well as just chucking workloads in full vm (if in container with --network=host) and using ip routes and policies.

  • oldsj 57 minutes ago
    I’ve been working on running agents (Claude agent sdk) on k8s this looks great to control their egress
  • m1keil 1 hour ago
    Pragmatic and practical. I learned something, thanks.