GitHub Blog

Making secret scanning more trustworthy: Reducing false positives at scale

GitHub collaborated with Microsoft's Agents Offense team to add contextual reasoning to secret scanning verification. By extracting focused, high-signal context from code rather than entire files, the system better distinguishes real secrets from false alarms. This approach achieved a 75.76% reduction in false positives on customer-confirmed alerts while maintaining detection coverage and scalability, improving developer trust and triage efficiency.

Statesummarized

Snapshots1

AI Outputs2

Open issues0

validated summary

English

Making secret scanning more trustworthy: Reducing false positives at scale

GitHub collaborated with Microsoft to add contextual reasoning to secret scanning verification.
Focused context extraction from code improves verification without analyzing entire files or repositories.
The new approach achieved a 75.76% reduction in false positives on customer-confirmed alerts.
Better context helps distinguish real secrets from lookalikes like UUIDs or opaque strings.
The method maintains high detection coverage and scalability across large codebases.

Chinese

让秘密扫描更可信：大规模减少误报

GitHub与微软的Agents Offense团队合作，为秘密扫描验证增加上下文推理。通过从代码中提取集中、高信号的信息，而非整个文件，系统更好地区分真实秘密和误报。该方法在客户确认的误报中实现了75.76%的减少，同时保持检测覆盖率和可扩展性，提高了开发者的信任和分类效率。

GitHub与微软合作，为秘密扫描验证增加上下文推理。
从代码中提取集中上下文，无需分析整个文件即可改进验证。
新方法在客户确认的误报中实现了75.76%的减少。
更好的上下文有助于区分真实秘密与UUID等类似物。
该方法保持高检测覆盖率和跨大型代码库的可扩展性。

Secret Scanning / false positives / AI / LLM / security / GitHub / Microsoft / contextual reasoning / verification / developer trust

Full Article

Mariko Wakabayashi · @mwakaba2

June 11, 2026|

5 minutes

Share:Secret scanning plays a critical role in protecting developers and organizations. It helps catch exposed credentials early and prevents small mistakes from turning into real incidents.At GitHub’s scale, even small inefficiencies create real friction. Too many false positives make alerts harder to trust.When alerts feel noisy, developers spend more time triaging and less time fixing real issues. Over time, this slows down remediation and reduces confidence in the system.To address this challenge, GitHub collaborated with Microsoft Security & AI’s Agents Offense team to bring more contextual reasoning into GitHub’s secret scanning verification. The collaboration applied the verification approach from Agentic Secret Finder, a broader detection and verification system developed to understand potential secrets in context, not just whether they match a secret-like pattern. This helped GitHub explore ways to reduce low-value alerts while preserving the coverage you expect from secret scanning.Secret scanning at GitHub todayGitHub secret scanning combines pattern-based detection with AI-based detection to identify potential secrets. Pattern-based detection catches known secret formats, such as partner patterns for tokens and API keys. AI-powered generic secret detection expands coverage to unstructured secrets like passwords that don’t match a known provider pattern.GitHub already has industry-leading precision for provider-pattern secret detection at massive scale, processing billions of pushes and protecting tens of millions of developers across millions of repositories.As GitHub expanded into AI-powered secret detection, the next challenge was bringing the precision of AI-detected secrets closer to the same high standard as provider-pattern detections. This collaboration focused on combining GitHub’s large-scale detection pipeline with LLM-based contextual verification to improve alert quality and developer trust.Our approach: Make secret scanning alerts trustworthySecret scanning is most useful when you can quickly tell which alerts need action.GitHub already has safeguards to reduce noise, but some secret-like values need more context to determine whether they represent a real exposure. To make those alerts easier to trust, we added more reasoning to the verification step.By looking at how a detected value appears in code, the system can better separate real exposures from values that only look sensitive. This helps you spend less time investigating low-value alerts and more time fixing the issues that matter.Where this fits in the pipelineThis approach builds directly on the existing system. Detection continues to generate candidates, and the verification step evaluates them. More context-awareness makes this system better at distinguishing real secrets from noise.The result is higher precision without changing upstream detection logic or reducing coverage.How it worksA key challenge in verification is deciding what context to provide.A small snippet of code is often not enough to determine whether something is a real secret. At the same time, passing entire files or repositories introduces too much noise and increases cost and latency.Instead of giving more context, we’re giving better context.Rather than send large amounts of code, we extract a small set of high-signal information that helps explain how the value is used. For example, we look for cases where a value is assigned to a variable and later passed into an API request, authentication header, database client, or cloud SDK call. Pattern matching can tell us that a value looks like a secret, but it can’t tell us whether the value is actually being used as one. The surrounding usage context helps the model distinguish real exposures from false alarms, such as random UUIDs or opaque strings, without reviewing the full file or repository.Focused context, not more dataIt’s natural to assume that improving accuracy requires analyzing more of the codebase. But the opposite is true.Most false positives can be resolved with focused, file-level context. What matters is not how much code the model sees, but whether it has the right signals.In many cases, you can determine whether a value is a real secret by looking at how it is used within a single file. Values that resemble placeholders, test data, or unused configuration can often be filtered out without deeper analysis.This keeps the system both effective and practical: high accuracy, low latency, and the ability to scale across large codebases.Results: reducing false positives in practiceWe evaluated this approach on hundreds of customer-confirmed false positive alerts.Our target was a 65% reduction. The result was 75.76%, exceeding that goal while maintaining strong detection performance.In practice, this means significantly less noise and a higher proportion of alerts that require action.False positive reduction results based on hundreds of customer-confirmed false positive alerts.This improvement shows up directly in the developer experience. With fewer irrelevant alerts, it becomes easier to trust what you see. Less time is spent triaging noise, and real issues can be prioritized and fixed faster.What’s nextWe’re continuing to evaluate this approach on larger datasets and live traffic, while improving how context is extracted and used for verification.Reducing false positives has been a consistent need at scale. This work focuses on improving signal quality where it matters most, making alerts easier to trust and act on.The goal is simple: fewer distractions, clearer signals, and faster action on real risks.Get started by running the risk assessment for your organization today, or learn more about secret scanning .Tags:

Secret ScanningWritten byMariko is a Principal Applied Scientist at Microsoft, where she leads the development of agentic AI workflows for cybersecurity operations. Her current interests focus on LLM-powered systems, agentic workflows, and applying frontier AI research to real-world products and operations.Related postsAI & ML

What are git worktrees, and why should I use them?Git worktrees have been around since 2015, but it wasn’t until recently they became popular. Learn what they are, how to use them, and why you might.AI & ML

GitHub Copilot CLI for Beginners: Overview of common slash commandsGitHub Copilot CLI for Beginners: Learn how to use slash commands to control your terminal AI agent.AI & ML

Accelerating researchers and developers building multilingual AI with a new open datasetA new repository-level dataset, published on GitHub under CC0-1.0, helps researchers and developers discover multilingual developer content across READMEs, issues, and pull requests.We do newsletters, tooDiscover tips, technical guides, and best practices in our biweekly newsletter just for devs.Your email address

Read Original Article →

Snapshots

Fetch evidence retained for parsing and audit.

200 · text/html; charset=UTF-8

06/17/2026, 10:57 AM

07a67d06f612c11807cc34d2eb3553e4a38d0dbba694d6a24bd890d57b8b680a

AI Outputs

Structured model outputs with validation status.

article.summarize

deepseek-v4-flash · valid

{"tags":["Secret Scanning","false positives","AI","LLM","security","GitHub","Microsoft","contextual reasoning","verification","developer trust"],"titleEn":"Making secret scanning more trustworthy: Reducing false positives at scale","titleZh":"让秘密扫描更可信：大规模减少误报","summaryEn":"GitHub...

article.classify

deepseek-v4-flash · valid

{"relevant":true,"confidence":0.9,"primaryTopic":"product-updates","secondaryTopics":["ai-engineering","agent-engineering"]}

Quality And Digest Links

Open and resolved issues, plus daily digest appearances.