<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>pii detection &#8211; Blue Lance</title>
	<atom:link href="https://bluelance.com/docs-tag/pii-detection/feed/" rel="self" type="application/rss+xml" />
	<link>https://bluelance.com</link>
	<description></description>
	<lastBuildDate>Mon, 01 Jun 2026 18:46:48 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=7.0</generator>

<image>
	<url>https://bluelance.com/wp-content/uploads/2025/11/fevicon-ic-1.png</url>
	<title>pii detection &#8211; Blue Lance</title>
	<link>https://bluelance.com</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Configuring PII Detection Rules</title>
		<link>https://bluelance.com/docs/configuring-pii-detection-rules/</link>
		
		<dc:creator><![CDATA[peter thomas]]></dc:creator>
		<pubDate>Thu, 28 May 2026 16:21:57 +0000</pubDate>
				<guid isPermaLink="false">https://bluelance.com/?post_type=docs&#038;p=15885</guid>

					<description><![CDATA[PII detection rules define the patterns that PII Scanner uses to identify sensitive data in scanned files. Each rule consists of a regex pattern that is applied to file content during a scan — when a match is found, the result is forwarded in real time to LT Auditor MP. Configuring the right detection rules [&#8230;]]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">PII detection rules define the patterns that PII Scanner uses to identify sensitive data in scanned files. Each rule consists of a regex pattern that is applied to file content during a scan — when a match is found, the result is forwarded in real time to LT Auditor <sup>MP</sup>. Configuring the right detection rules is critical to ensuring your scans are both thorough and accurate, minimizing both missed detections and false positives.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph"><strong>Understanding PII detection rules:</strong></p>



<p class="wp-block-paragraph">PII Scanner ships with a set of built-in detection rules covering the most common categories of sensitive data. These built-in rules can be enabled, disabled, or modified to suit your environment. Custom rules can also be added to detect organization-specific sensitive data types that are not covered by the defaults.</p>



<p class="wp-block-paragraph">Each detection rule consists of:</p>



<ul class="wp-block-list">
<li><strong>Name</strong> — a descriptive label for the PII class (e.g., Social Security Number)</li>



<li><strong>Regex Pattern</strong> — the regular expression used to identify matches in file content</li>



<li><strong>Enabled Status</strong> — whether the rule is active and applied during scans</li>



<li><strong>Severity Level</strong> — the importance of a match (Critical, High, Medium, Low)</li>
</ul>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph"><strong>Accessing PII detection rules:</strong></p>



<p class="wp-block-paragraph">Log in to the PII Scanner Server web UI at:<br>https://&lt;PII_Scanner_Server_IP&gt;:52766</p>



<ol class="wp-block-list">
<li></li>



<li>Navigate to <strong>Admin → PII Patterns</strong></li>



<li>The patterns list displays all configured detection rules with their name, pattern, enabled status, and severity level</li>
</ol>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph"><strong>Built-in PII detection rules:</strong></p>



<p class="wp-block-paragraph">PII Scanner includes the following default detection rules:</p>



<figure class="wp-block-table"><table class="has-fixed-layout"><tbody><tr><td><strong>PII Class</strong></td><td><strong>Description</strong></td><td><strong>Example Match</strong></td></tr><tr><td>Social Security Number</td><td>US SSN in common formats</td><td>123-45-6789, 123456789</td></tr><tr><td>Credit Card Number</td><td>Major card formats (Visa, Mastercard, Amex, Discover)</td><td>4111 1111 1111 1111</td></tr><tr><td>Email Address</td><td>Standard email format</td><td>user@domain.com</td></tr><tr><td>Phone Number</td><td>US and international formats</td><td>(555) 123-4567</td></tr><tr><td>Date of Birth</td><td>Common date formats</td><td>01/15/1980, 1980-01-15</td></tr><tr><td>Medical Record Number</td><td>Common MRN formats</td><td>Varies by healthcare system</td></tr><tr><td>IP Address</td><td>IPv4 address format</td><td>192.168.1.100</td></tr><tr><td>Passport Number</td><td>Common passport formats</td><td>Varies by country</td></tr></tbody></table></figure>



<p class="wp-block-paragraph"><em>[Your administrator should confirm which built-in rules are appropriate for your environment and compliance requirements, and disable any that generate excessive false positives.]</em></p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph"><strong>Enabling and disabling detection rules:</strong></p>



<p class="wp-block-paragraph">To enable or disable a built-in rule without deleting it:</p>



<ol class="wp-block-list">
<li>Navigate to <strong>Admin → PII Patterns</strong></li>



<li>Locate the rule in the patterns list</li>



<li>Click the <strong>Enabled</strong> toggle to turn the rule on or off</li>



<li>The change takes effect on the next scan job that runs</li>
</ol>



<p class="wp-block-paragraph">Disabled rules are not applied during scans but are retained in the system and can be re-enabled at any time. Prefer disabling over deleting built-in rules so they can be recovered if needed.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph"><strong>Editing an existing detection rule:</strong></p>



<p class="wp-block-paragraph">To modify the regex pattern or severity level of an existing rule:</p>



<ol class="wp-block-list">
<li>Navigate to <strong>Admin → PII Patterns</strong></li>



<li>Click the <strong>Edit</strong> icon next to the rule</li>



<li>Modify the relevant fields:
<ul class="wp-block-list">
<li><strong>Name</strong> — update if needed for clarity</li>



<li><strong>Regex Pattern</strong> — update the pattern to improve accuracy or reduce false positives</li>



<li><strong>Severity Level</strong> — adjust based on the sensitivity of the data type</li>
</ul>
</li>



<li>Click <strong>Save</strong></li>
</ol>



<p class="wp-block-paragraph">Test any modified regex patterns against sample data before activating them in a scan to confirm they match the intended data and do not produce excessive false positives.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph"><strong>Creating a custom detection rule:</strong></p>



<p class="wp-block-paragraph">Custom rules allow you to detect organization-specific sensitive data types not covered by the built-in patterns — such as employee ID numbers, internal account codes, or proprietary data formats.</p>



<ol class="wp-block-list">
<li>Navigate to <strong>Admin → PII Patterns</strong></li>



<li>Click <strong>Add Pattern</strong></li>



<li>Configure the custom rule:
<ul class="wp-block-list">
<li><strong>Name</strong> — a clear, descriptive name for the data type (e.g., Employee ID Number)</li>



<li><strong>Description</strong> — a brief explanation of what this pattern detects</li>



<li><strong>Regex Pattern</strong> — the regular expression to match the data type</li>



<li><strong>Severity Level</strong> — Critical, High, Medium, or Low based on data sensitivity</li>
</ul>
</li>



<li>Click <strong>Save</strong></li>
</ol>



<p class="wp-block-paragraph"><strong>Example custom patterns:</strong></p>



<figure class="wp-block-table"><table class="has-fixed-layout"><tbody><tr><td><strong>Data Type</strong></td><td><strong>Example Regex Pattern</strong></td></tr><tr><td>Employee ID (EMP + 6 digits)</td><td>EMP\d{6}</td></tr><tr><td>Internal account code</td><td>ACC-[A-Z]{2}-\d{4}</td></tr><tr><td>UK National Insurance Number</td><td>[A-Z]{2}\d{6}[A-Z]</td></tr><tr><td>Canadian SIN</td><td>\d{3}-\d{3}-\d{3}</td></tr><tr><td>Australian Tax File Number</td><td>\d{3}\s\d{3}\s\d{3}</td></tr></tbody></table></figure>



<p class="wp-block-paragraph"><em>[Your administrator should work with your legal and compliance teams to identify any organization-specific data types that require custom detection rules.]</em></p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph"><strong>Writing effective regex patterns:</strong></p>



<p class="wp-block-paragraph">When creating or modifying detection rules, keep the following in mind:</p>



<p class="wp-block-paragraph"><strong>Be specific enough to avoid false positives:</strong> A pattern that is too broad will match unintended content and generate noise in your scan results. For example, a simple \d{9} pattern would match any 9-digit number, not just Social Security Numbers.</p>



<p class="wp-block-paragraph"><strong>Be flexible enough to catch real matches:</strong> Data is not always formatted consistently. SSNs may appear with or without dashes. Phone numbers may use spaces, dots, or dashes as separators. Build flexibility into patterns where appropriate:</p>



<p class="wp-block-paragraph"># SSN — matches with or without dashes</p>



<pre class="wp-block-code"><code>\b\d{3}&#91;-\s]?\d{2}&#91;-\s]?\d{4}\b</code></pre>



<p class="wp-block-paragraph"># Phone — matches multiple separator styles</p>



<pre class="wp-block-code"><code>\b(\+1&#91;-\s]?)?\(?\d{3}\)?&#91;-\s.]?\d{3}&#91;-\s.]?\d{4}\b</code></pre>



<p class="wp-block-paragraph"><strong>Use word boundaries:</strong> Add \b (word boundary) anchors to prevent partial matches within longer strings:</p>



<p class="wp-block-paragraph"># Without boundary — matches &#8220;123456789&#8221; inside &#8220;9123456789&#8221;</p>



<pre class="wp-block-code"><code>\d{9}</code></pre>



<p class="wp-block-paragraph"># With boundary — only matches standalone 9-digit numbers</p>



<pre class="wp-block-code"><code>\b\d{9}\b</code></pre>



<p class="wp-block-paragraph"><strong>Test patterns before activating:</strong> Use an online regex tester with representative sample data from your environment to validate patterns before adding them to PII Scanner.</p>



<p class="wp-block-paragraph"><em>[Your administrator should involve your security or data governance team when writing custom regex patterns to ensure accuracy and compliance alignment.]</em></p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph"><strong>Managing detection rule severity levels:</strong></p>



<p class="wp-block-paragraph">Severity levels help prioritize scan results in LT Auditor <sup>MP</sup> and can be used to drive alert rules and compliance reporting. Assign severity levels based on the regulatory and business impact of each data type:</p>



<figure class="wp-block-table"><table class="has-fixed-layout"><tbody><tr><td><strong>Severity</strong></td><td><strong>Examples</strong></td></tr><tr><td>Critical</td><td>SSNs, credit card numbers, medical record numbers, passport numbers</td></tr><tr><td>High</td><td>Email addresses combined with other PII, dates of birth, financial account numbers</td></tr><tr><td>Medium</td><td>Phone numbers, IP addresses, employee IDs</td></tr><tr><td>Low</td><td>Internal codes, reference numbers with limited sensitivity</td></tr></tbody></table></figure>



<p class="wp-block-paragraph"><em>[Your administrator should define severity levels in alignment with your organization&#8217;s data classification policy.]</em></p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph"><strong>Reviewing detection rule effectiveness:</strong></p>



<p class="wp-block-paragraph">After running scan jobs, review the results in LT Auditor <sup>MP</sup> to assess whether your detection rules are performing as expected:</p>



<ol class="wp-block-list">
<li>Navigate to <strong>View</strong> in the LT Auditor <sup>MP</sup> Web UI</li>



<li>Filter by <strong>Source — PII Scanner</strong></li>



<li>Review the PII classes detected across recent scans</li>



<li>Identify:
<ul class="wp-block-list">
<li><strong>High false positive rates</strong> — rules generating many matches that are not actually sensitive data — consider tightening the regex pattern or disabling the rule</li>



<li><strong>Missed detections</strong> — known sensitive data that is not being detected — review and update the relevant regex pattern</li>



<li><strong>Unexpected findings</strong> — sensitive data found in unexpected locations — flag for remediation and access control review</li>
</ul>
</li>
</ol>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph"><strong>Best practices:</strong></p>



<ul class="wp-block-list">
<li>Review and validate all built-in detection rules before running your first scan to confirm they are appropriate for your environment</li>



<li>Disable built-in rules that consistently generate false positives in your environment rather than tolerating the noise</li>



<li>Test all custom regex patterns thoroughly with real sample data before activating them</li>



<li>Assign severity levels consistently across all rules to ensure reliable prioritization in LT Auditor <sup>MP</sup></li>



<li>Review detection rules regularly — data types and formats used in your organization may change over time</li>



<li>Document the purpose and expected output of each custom rule so other administrators can maintain them</li>



<li>Involve your legal and compliance teams when defining rules for regulated data types to ensure alignment with your compliance obligations</li>
</ul>



<p class="wp-block-paragraph"><em>[Your administrator should schedule a periodic review of all active detection rules — at minimum annually, or whenever compliance requirements or data handling practices change in your organization.]</em></p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>
