๐Ÿ” CVE Alert

CVE-2026-12243

HIGH 7.5

Path Traversal via Percent-Encoding in nltk.data.find() and nltk.data.load()

CVSS Score
7.5
EPSS Score
0.0%
EPSS Percentile
0th

NLTK version 3.9.4 is vulnerable to a path traversal attack due to an incomplete fix for GitHub Issue #3504. The `_UNSAFE_NO_PROTOCOL_RE` regex in `nltk/data.py` checks for literal `../` sequences but fails to account for percent-encoded traversal sequences such as `..%2f`. The `url2pathname()` function decodes these sequences after the validation step, allowing an attacker to bypass the protection. This vulnerability enables an attacker to read arbitrary files accessible to the Python process by controlling the resource name parameter passed to `nltk.data.load()` or `nltk.data.find()`. The issue affects applications that rely on NLTK for resource loading, including NLP web applications, Jupyter notebooks, and CLI tools. The default `pathsec.ENFORCE=False` setting exacerbates the impact by not blocking the file read at the `open()` stage.

CWE CWE-22
Vendor nltk
Product nltk/nltk
Published Jun 30, 2026
Stay Ahead of the Next One

Get instant alerts for nltk nltk/nltk

Be the first to know when new high vulnerabilities affecting nltk nltk/nltk are published โ€” delivered to Slack, Telegram or Discord.

Get Free Alerts โ†’ Free ยท No credit card ยท 60 sec setup

CVSS v3 Breakdown

CVSS:3.0/AV:N/AC:L/PR:N/UI:N/S:U/C:H/I:N/A:N

Affected Versions

nltk / nltk/nltk
unspecified โ‰ค latest

References

NVD โ†— CVE.org โ†— EPSS Data โ†—
huntr.com: https://huntr.com/bounties/39aa9354-54ca-4e77-96da-580eb1fe6ed1