Blog post: How to Properly Secure Your Valkey Deployment #389

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

allenheltondev wants to merge 3 commits into valkey-io:main from allenheltondev:main

+108 −0

allenheltondev commented Oct 10, 2025

Description

Blog post: How to Properly Secure Your Valkey Deployment

Issues Resolved

Check List

Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the BSD-3-Clause License.

allenheltondev added 3 commits

October 7, 2025 16:50


          vulnerability revealing bigger problem blog

c548c72


          feedback updates

127121d


          Rewrite what I had into a purposeful security blog

d3db24b

allenheltondev requested review from madolson and stockholmux as code owners

October 10, 2025 18:38

Author

allenheltondev commented Oct 21, 2025

Bump @madolson or @stockholmux for review

Member

madolson commented Nov 7, 2025

On it now! Sorry for the delay, some work fires recently :)

madolson reviewed

View reviewed changes

Member

madolson left a comment

I like the structure, some nits and misc recommendations, but I think it's close.

content/blog/2025-10-15-properly-secure-your-valkey-deployment.md

+              featured_image = "/assets/media/featured/random-06.webp"
+              +++
+              Most of the production incidents I’ve helped debug started with misconfigurations rather than zero-days or sophisticated exploits.

Member

madolson Nov 7, 2025

Suggested change

      
            Most of the production incidents I’ve helped debug started with misconfigurations rather than zero-days or sophisticated exploits.
          
            Most of the production security incidents I’ve helped debug started with misconfigurations rather than zero-days or sophisticated exploits.

content/blog/2025-10-15-properly-secure-your-valkey-deployment.md


		Security misconfiguration ranks as A05 in the [OWASP Top 10:2021](https://owasp.org/Top10/A05_2021-Security_Misconfiguration/), with 90% of applications tested showing some form of misconfiguration. That's staggering. And when it comes to infrastructure like Valkey, the stakes are even higher - your cache often sits at the heart of your application, touching every request.

		Engineers really care about security - but it is easy to overlook some crucial settings. This is especially true in the cloud, where everything moves really fast. You spin up a Valkey instance inside your VPC, it works, and you move on to the next problem. VPC can lock down your network to the outsiders - but I often see multiple teams being able to access the same VPC. This leaves systems vulnerable to insider threads as well as well intentioned people or microservices that just happen to have a bad day. But using default configurations or enabling unnecessary features can make systems [easy targets for attackers](https://socradar.io/redis-redishell-vulnerability-cve-2025-49844/).

Member

madolson Nov 7, 2025

Suggested change

      
            Engineers really care about security - but it is easy to overlook some crucial settings. This is especially true in the cloud, where everything moves really fast. You spin up a Valkey instance inside your VPC, it works, and you move on to the next problem. VPC can lock down your network to the outsiders  - but I often see multiple teams being able to access the same VPC. This leaves systems vulnerable to insider threads as well as well intentioned people or microservices that just happen to have a bad day. But using default configurations or enabling unnecessary features can make systems [easy targets for attackers](https://socradar.io/redis-redishell-vulnerability-cve-2025-49844/).
          
            Engineers really care about security - but it is easy to overlook some crucial settings. This is especially true in the cloud, where everything moves really fast. You spin up a Valkey instance inside your VPC, it works, and you move on to the next problem. VPC can lock down your network to the outsiders  - but I often see multiple teams being able to access the same VPC. This leaves systems vulnerable to insider threats as well as well intentioned people or microservices that just happen to have a bad day. But using default configurations or enabling unnecessary features can make systems [easy targets for attackers](https://socradar.io/redis-redishell-vulnerability-cve-2025-49844/).

content/blog/2025-10-15-properly-secure-your-valkey-deployment.md


		This is where putting your Valkey node inside a VPC is necessary - but not sufficient. Security groups help reinforce access limitation to make sure that only services and people who are intended to access the cluster can do so. Your CI runners probably don't need direct cache access. Each service should have just the access it needs.

		Modern infrastructure also handles TLS seamlessly. While it is unlikely that an attacker is sniffing your packets on your cloud network, it is best practice to have encryption in transit - even within your own network.

Member

madolson Nov 7, 2025

Maybe also worth adding that modern hardware handles TLS handshakes and traffic much better, so it's much less of an impact on hardware than it used to. Redis only added TLS support in 2020, some people might not know about that.

content/blog/2025-10-15-properly-secure-your-valkey-deployment.md


		Authentication adds a critical layer of resiliency. The authentication layer protects you if your firewall or other protections fail, unauthenticated clients still can't access your instance.

		Valkey supports [two authentication methods](https://valkey.io/topics/security/#authentication): the newer ACL system (Access Control Lists) and the legacy `requirepass`. ACLs give you more flexibility by allowing you to create users with fine-grained permissions tailored to what each service actually needs.

Member

madolson Nov 7, 2025

Might also consider adding https://valkey.io/topics/ldap/ which avoids having to manage a separate credential system.

content/blog/2025-10-15-properly-secure-your-valkey-deployment.md

+              ACL SETUSER admin on >verystrongpassword ~* +@all
+              ```
+              This principle of least privilege means that even if credentials are compromised, an attacker is limited to only the operations the user can perform. A read-only monitoring account can't flush your entire cache or modify configurations.

Member

madolson Nov 7, 2025

Maybe provide a suggested user permissions. I typically suggest:

ACL SETUSER application on >password +@all -@dangerous -@scripting

As a good base for applications. Most issues come from scripting or dangerous commands.

content/blog/2025-10-15-properly-secure-your-valkey-deployment.md


		Once Valkey is running, your operational posture determines how quickly you can detect and contain issues. Enable logging so you can see what's happening. Monitor for unusual patterns like sudden spikes in command execution, connections from unexpected sources, or commands that shouldn't be running in your environment.

		Set resource limits in your configuration. Poorly written operations or runaway commands can impact your cache's availability. `maxmemory`, `timeout`, and `tcp-keepalive` settings aren't just performance tuning - they help protect against resource exhaustion.

Member

madolson Nov 7, 2025

Suggested change

      
            Set resource limits in your configuration. Poorly written operations or runaway commands can impact your cache's availability. `maxmemory`, `timeout`, and `tcp-keepalive` settings aren't just performance tuning - they help protect against resource exhaustion.
          
            Set resource limits in your configuration. Poorly written operations or runaway commands can impact your cache's availability. `maxmemory`, `timeout`, and `tcp-keepalive` settings aren't just performance tuning - they help protect against resource exhaustion.

I don't think tcp-keepalive does much for resource exhaustion that timeout won't also cover. It's more for unreliable networks.

I also generally recommend not setting timeout anyways. There is a special timeout for unauthenticated users, and the normal timeout normally just causes unnecessary reconnects for normal applications. Maxmemory makes sense though.

content/blog/2025-10-15-properly-secure-your-valkey-deployment.md


		Set resource limits in your configuration. Poorly written operations or runaway commands can impact your cache's availability. `maxmemory`, `timeout`, and `tcp-keepalive` settings aren't just performance tuning - they help protect against resource exhaustion.

		Observability is part of security! Logs and metrics turn silent failures into visible signals, and visibility is what buys you time to respond before small issues become incidents.

Member

madolson Nov 7, 2025

I would suggest adding acl_access_denied_auth here, it's authentication failures.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet