DLP Lexicon
A comprehensive glossary of Data Security and AI Security terminology, concepts, and best practices covering modern DLP, insider risk management, agentic AI, and data governance. This content is for general educational purposes and is not specific to any particular vendor or product.
A
Access Control
Security technique that regulates who or what can view or use resources in a computing environment.
Account Takeover (ATO)
An attack where a threat actor gains unauthorised access to a legitimate user account to operate with insider-level privileges.
Activity Feed
A unified, time-sequenced stream of user and data events for investigation and forensic timelines.
Adversary Intelligence
Information about threat actors' tactics, techniques, and procedures used to inform defensive priorities.
Agent Governance
The framework of policies, controls, and oversight applied to autonomous AI agents.
Agent Identity
Treating AI agents as first-class identity principals with their own permissions and audit trails.
Agentic AI
AI systems capable of autonomous action, including planning tasks, calling tools, and making decisions without continuous human oversight.
Agentic Workflow
A multi-step process where AI agents coordinate tool calls and data access to accomplish complex tasks.
AI Acceptable Use Policy
Organisational policy defining approved AI tools, permitted use cases, and required controls for AI interaction.
AI Agent
A software entity powered by an AI model that performs tasks by chaining tool calls and reasoning over results.
AI Gateway
A security control point mediating interactions between users or applications and AI services.
AI Governance
The framework of policies, controls, and oversight governing how AI systems are deployed, monitored, and used.
Alert Fatigue
Reduced analyst effectiveness caused by excessive volumes of low-quality alerts from security systems.
Allow
A DLP policy decision that permits a data movement event to proceed, typically logged for audit purposes.
Anonymisation
Irreversible removal of identifying information so individuals cannot be re-identified, even by the data controller.
API Integration
Programmatic interfaces that enable DLP solutions to connect with and monitor third-party applications and services.
Audit-Grade Evidence
Forensic data produced to a standard sufficient for regulatory audits, legal proceedings, or formal investigations.
Automated Remediation
Automatic execution of corrective actions in response to detected policy violations.
Autonomous Agent
An AI agent that operates with minimal human supervision, making decisions across enterprise systems.
B
Behavioural Baseline
A learned profile of normal activity patterns used as the reference point for detecting anomalous behaviour.
Block
A DLP enforcement action that prevents a data movement event from completing.
Block Override
A DLP capability allowing users to override a block by providing a justification, captured as audit evidence.
Browser DLP
DLP controls applied within the browser session to monitor uploads, downloads, copy/paste, and AI prompt submissions.
Browser Extension
A plugin for standard browsers that adds security capabilities such as DLP or AI governance.
C
Case Management
Workflow capabilities for organising investigations, tracking evidence, and producing audit-grade output.
Chain of Custody
A documented, auditable trail of who handled data, when, and what actions were performed.
Clipboard Capture
Recording of clipboard content as forensic evidence for copy/paste-based data exfiltration investigations.
Closed-Loop Remediation
A remediation workflow that detects a violation, takes corrective action, and verifies the resolution.
Cloud Access Security Broker (CASB)
A security control point enforcing policies on user interactions with cloud applications.
Cloud DLP
DLP controls applied to data stored in or moving through cloud services via API or inline proxies.
Cloud Security Posture Management (CSPM)
Solutions that assess and remediate misconfigurations and compliance violations in cloud infrastructure.
Cloud-Native Application Protection Platform (CNAPP)
An integrated platform combining CSPM, CWPP, DSPM, and other cloud security capabilities.
Cloud-Native DLP
DLP solutions designed specifically for cloud environments and SaaS applications.
Compromised Insider
A legitimate user whose account or credentials have been taken over by an external attacker.
Connector
An integration linking a DLP platform to a specific data source or enforcement point.
Coverage
The breadth of channels, data types, and applications a DLP solution can monitor and enforce policy on.
Credential Theft
Acquisition of legitimate user credentials through phishing, malware, or social engineering.
Cross-Platform Data Protection
Unified data security providing consistent protection across different operating systems and platforms.
Customer-Managed Storage
An architectural model where forensic evidence is stored in customer-controlled infrastructure.
D
Data at Rest
Sensitive data stored in databases, file systems, or backups, the primary focus of DSPM and data discovery.
Data Classification
Categorising data based on sensitivity, regulatory requirements, or business value to inform protection decisions.
Data Control Plane
An architectural model separating DLP intelligence (truth and context) from distributed enforcement execution.
Data Discovery
Systematically identifying and cataloguing sensitive data across an organisation's systems.
Data Exfiltration
The unauthorised transfer of sensitive data from within an organisation to an external destination.
Data in Motion
Sensitive data being actively transmitted across networks or between applications.
Data in Use
Sensitive data actively being processed, edited, or interacted with by users or applications.
Data Lineage
Continuous tracking of data from origin through every transformation, copy, and egress for provenance and forensics.
Data Loss Prevention (DLP)
A security strategy and set of technologies to detect, monitor, and protect sensitive information from unauthorised exposure.
Data Loss Prevention as a Service (DLPaaS)
Cloud-delivered DLP capabilities without on-premises infrastructure.
Data Minimisation
The principle that organisations should collect and retain only the minimum data necessary for a specified purpose.
Data Movement Context
Contextual metadata captured at the moment data moves, including source, destination, user, and channel.
Data Origin
Identification of the source account, system, or application from which data originated.
Data Provenance
The historical record of where data originated and how it has been transformed or shared.
Data Residency
Requirements that data be stored or processed within specific geographic boundaries.
Data Risk Analytics
Advanced analysis and reporting that provides insights into data-related risks and compliance status.
Data Security Posture Management (DSPM)
Solutions that discover, classify, and assess the security posture of sensitive data across cloud and SaaS environments.
Data Sovereignty
The principle that data is subject to the laws of the country in which it is collected or stored.
Device Quarantine
Security response action that isolates compromised or high-risk devices from network access while preserving evidence.
Distributed Enforcement
Policy enforcement applied at multiple control points across the data path rather than a single chokepoint.
Document Fingerprinting
Creating a unique cryptographic signature of a document to identify exact or partial reuse after format changes.
E
Email DLP
DLP controls applied to outbound email to detect and prevent transmission of sensitive data.
Endpoint Detection and Response (EDR)
Security technology monitoring endpoints for malicious activity with forensic investigation and automated response.
Endpoint DLP
DLP controls at the user device level, monitoring local applications, removable media, clipboard, and file transfers.
Endpoint Protection Platform (EPP)
Security software on endpoints preventing malware and exploits through signature, behavioural, and ML detection.
Enforcement Layer
The execution component of a data control plane applying policy decisions in real time at distributed control points.
Enterprise Browser
A managed web browser with built-in security controls for SaaS access, DLP, and policy enforcement.
EU AI Act
The EU's regulatory framework for AI, classifying AI by risk level and imposing requirements on high-risk systems.
Exact Data Matching (EDM)
High-precision classification matching content against exact values to detect specific sensitive data with very low false positives.
Extended Detection and Response (XDR)
An evolution of EDR integrating telemetry from endpoints, networks, cloud, and identity into a unified platform.
F
False Positive
A DLP detection that incorrectly identifies legitimate activity as a policy violation.
File Shadow Copy
Forensic preservation of the actual file content involved in a DLP detection, stored separately for investigators.
Forensic Evidence
Records, artefacts, and contextual information collected during or after a security event for investigation.
G
GDPR
The EU's General Data Protection Regulation, the primary framework governing personal data processing in the EU and EEA.
GDPR Article 25
The GDPR provision requiring data protection by design and by default in data processing systems.
GenAI Prompt Capture
Collection of prompt content submitted to generative AI tools as forensic evidence for investigations.
GenAI Protection
Specialized security measures designed to protect sensitive data when using generative AI tools.
Generative AI (GenAI)
AI systems that produce new content (text, images, code, audio) based on patterns learned from training data.
I
Incident Response
Structured approach to handling and managing data security incidents and policy violations.
Indexed Document Matching (IDM)
Classification technique matching content against fingerprints of known sensitive documents to detect derivatives.
Indirect Prompt Injection
A prompt injection attack delivered through content an AI agent processes rather than direct user input.
Inline Enforcement
Real-time inspection and policy enforcement of data as it transits through a proxy, firewall, or browser session.
Insider Risk Management (IRM)
A security discipline combining behavioural analytics, data monitoring, case management, and investigation workflows.
Insider Threat
A security risk from individuals with authorised access, including negligent users, malicious insiders, and compromised accounts.
Insider Threat Management (ITM)
An alternative term for insider risk management focused on user activity monitoring and forensic investigation.
Intent Security
An emerging discipline detecting whether AI agent actions align with organisational policy intent.
Investigation Authorisation Workflow
A multi-level approval process required before analysts can access identifiable data during investigations.
Investigation Timeline
A time-sequenced reconstruction of user activity and data movement events for insider risk investigations.
ISO/IEC 42001:2023
The first international management system standard specifically for artificial intelligence governance.
J
Just-in-Time Coaching
User education delivered at the moment of a policy violation to modify behaviour without analyst workload.
Justification Capture
Recording of user-provided reasons for actions that triggered DLP policies, supporting investigations.
K
Keyword Matching
Basic detection method identifying sensitive content based on predefined keywords and phrases.
L
Large Language Model (LLM)
A generative AI type trained on large text corpora to produce human-like language output.
Last-Mile Enforcement
DLP controls at the user action layer where data leakage actually occurs, complementing upstream controls.
Lateral Movement
An attacker's progression through an organisation's systems after initial compromise.
Link Revocation
Automated removal of sharing permissions on files, particularly external sharing links, as a remediation action.
LLM-Based Classification
Data classification using large language models to understand semantic meaning and context of content.
M
Machine Learning
Advanced AI technique improving detection accuracy through automated learning from data patterns.
Malicious Insider
An individual with authorised access who deliberately exfiltrates data or harms the organisation.
MCP Gateway
A security control point between AI agents and MCP servers, inspecting tool calls and enforcing access policies.
MCP Server
A service implementing the Model Context Protocol to expose tools and data sources to AI agents.
Memory Poisoning
An attack introducing malicious data into an agent's persistent memory to alter its behaviour.
Microsoft Information Protection (MIP)
Microsoft's classification and labelling framework applying persistent sensitivity labels across Microsoft 365.
Microsoft Purview
Microsoft's unified data governance and compliance platform for DLP, IRM, Information Protection, and eDiscovery.
MITRE Insider Threat TTP
A standardised framework cataloguing tactics, techniques, and procedures observed in insider threat incidents.
Model Context Protocol (MCP)
An open standard for how AI systems access external tools, data sources, and services through a uniform interface.
Multi-Agent System
An architecture where multiple AI agents collaborate and coordinate to achieve goals beyond any single agent.
Multi-Tenant Architecture
Cloud infrastructure securely serving multiple organisations from a single software instance.
N
Negligent Insider
An employee or contractor causing a security incident through carelessness without malicious intent.
Network DLP
DLP controls at the network perimeter inspecting traffic for sensitive data exfiltration.
NIST AI Risk Management Framework
A US NIST framework providing voluntary guidance on managing risks associated with AI systems.
O
Optical Character Recognition (OCR)
Technology extracting text from images and scanned documents to enable content inspection by DLP engines.
Out-of-Band Enforcement
Asynchronous detection and remediation of policy violations after data movement, typically via API integration.
Output Redaction
Removing or masking sensitive information from AI-generated responses before delivery to users.
OWASP Top 10 for Agentic AI
OWASP's framework for the most critical security threats specific to autonomous AI agents.
OWASP Top 10 for LLMs
OWASP's catalogue of the most critical security risks for large language model applications.
P
Payment Card Information (PCI)
Credit card and payment data subject to PCI-DSS compliance requirements.
Personally Identifiable Information (PII)
Data that can identify a specific individual, a primary regulatory focus for DLP and privacy programmes.
Policy Engine
Core component evaluating content against defined rules and determining enforcement actions.
Policy Sprawl
The proliferation of DLP rules and policies leading to inconsistency, redundancy, and operational complexity.
Pre-Policy Visibility
Immediate visibility into data flows and user activity from agent deployment, before any DLP policies are configured.
Privacy by Design
An architectural principle requiring privacy protections to be embedded into systems from the outset.
Privilege Compromise
An agentic AI threat where an agent inherits or exploits user permissions for unauthorised operations.
Privilege Escalation
An attack technique increasing access levels within a system, from standard user to administrative privileges.
Prompt Injection
An attack embedding malicious instructions into AI inputs to manipulate behaviour or bypass safeguards.
Prompt Inspection
Real-time analysis of content submitted to AI services to detect sensitive data or malicious instructions.
Protected Health Information (PHI)
Health-related data covered by regulations such as HIPAA, including medical records and health data linked to individuals.
Pseudonymisation
A privacy-enhancing technique replacing identifying information with artificial identifiers, reversible by authorised parties.
Q
Quarantine
A remediation action moving a file to a restricted location pending review, preventing further sharing.
R
RBAC Pseudonymisation
Role-Based Access Control applied to pseudonymisation, where only elevated-privilege investigators see real identities.
Regular Expressions
Pattern matching language used to define complex search criteria for sensitive data detection.
Risk Score
A dynamic numeric or categorical rating assigned to a user, entity, or activity based on multiple risk signals.
Risk-Informed User Education
Targeted security training based on individual user risk profiles and behaviour patterns.
S
SaaS API Enforcement
DLP enforcement applied through SaaS platform APIs, enabling out-of-band detection and remediation.
SaaS Security Posture Management (SSPM)
Solutions monitoring and managing security configuration and posture of SaaS applications.
Sandboxing
Isolated environment for safely analyzing suspicious files and content without risk to production systems.
Screen Capture
A forensic artefact consisting of a screenshot at the moment of a policy violation for investigation context.
Secure Access Service Edge (SASE)
An architectural model combining SD-WAN networking with SSE security into a single cloud-delivered service.
Secure Data Flow
Data lineage technology tracking sensitive data from origin through manipulations and egress points.
Security Fabric
An integrated security architecture where multiple products share telemetry, intelligence, and policy context.
Security Information and Event Management (SIEM)
A platform collecting, correlating, and analysing security events to detect threats and support investigations.
Security Orchestration, Automation, and Response (SOAR)
A platform automating security operations workflows, integrating tools and orchestrating response actions.
Security Service Edge (SSE)
A cloud-delivered security category combining SWG, CASB, and ZTNA into a unified platform.
Semantic Classification
Classification interpreting the meaning and context of data rather than matching specific patterns or keywords.
Sensitivity Label
A persistent classification tag indicating data sensitivity level that travels with the data through sharing.
Shadow AI
Unauthorised or undisclosed use of AI tools by employees, creating data exposure and compliance risk.
Shadow GenAI
Unauthorised use of generative AI tools like ChatGPT, Gemini, and Claude outside organisational governance.
T
Ticket Factory
A pejorative term for security operations characterised by bulk-closing low-context alerts rather than active investigation.
Time-to-First-Signal (TTFS)
The elapsed time between deploying a DLP solution and producing the first actionable risk signal.
Tokenization
Data protection replacing sensitive data with non-sensitive placeholder tokens.
Tool Calling
The mechanism by which an AI agent invokes external functions, APIs, or services to perform actions.
Tool Misuse
An agentic AI threat where an agent is manipulated into using legitimate tools for malicious purposes.
Tool Poisoning
An attack maliciously modifying an MCP tool's description or behaviour to deceive an agent.
Trainable Classifier
A machine learning model trained on examples of sensitive data to identify similar content based on patterns.
True Positive
A DLP detection that correctly identifies an actual policy violation or risk event.
Truth Layer
The intelligence component of a data control plane establishing what data is sensitive and how it is used.
Tuning
The ongoing process of adjusting DLP policies and detection thresholds to balance accuracy against false positives.
U
User Activity Monitoring
Comprehensive tracking and analysis of user interactions with data and systems.
User and Entity Behaviour Analytics (UEBA)
Analytics establishing behavioural baselines for users and entities, detecting anomalous deviations indicating risk.
User Nudge
A real-time notification prompting users to reconsider an action or justify a violation, preserving user agency.
W
Warn
A DLP enforcement action displaying a notification about a potential violation but allowing the action to proceed.
Watermarking
Technique embedding identifying information into documents for tracking and attribution purposes.
Webhook
A mechanism for real-time event notifications between systems via HTTP callbacks.
Works Council
Employee representative bodies with legal rights to be consulted on workplace monitoring and DLP deployment.
Z
Zero Trust Architecture
Security model assuming no implicit trust, continuously verifying every transaction.
Zero Trust Network Access (ZTNA)
A security model granting access based on continuous verification of identity, device posture, and context.