tftsr-devops_investigation/src-tauri/src/pii/detector.rs
Shaun Arman 8839075805 feat: initial implementation of TFTSR IT Triage & RCA application
Implements Phases 1-8 of the TFTSR implementation plan.

Rust backend (Tauri 2.x, src-tauri/):
- Multi-provider AI: OpenAI-compatible, Anthropic, Gemini, Mistral, Ollama
- PII detection engine: 11 regex patterns with overlap resolution
- SQLCipher AES-256 encrypted database with 10 versioned migrations
- 28 Tauri IPC commands for triage, analysis, document, and system ops
- Ollama: hardware probe, model recommendations, pull/delete with events
- RCA and blameless post-mortem Markdown document generators
- PDF export via printpdf
- Audit log: SHA-256 hash of every external data send
- Integration stubs for Confluence, ServiceNow, Azure DevOps (v0.2)

Frontend (React 18 + TypeScript + Vite, src/):
- 9 pages: full triage workflow NewIssue→LogUpload→Triage→Resolution→RCA→Postmortem→History+Settings
- 7 components: ChatWindow, TriageProgress, PiiDiffViewer, DocEditor, HardwareReport, ModelSelector, UI primitives
- 3 Zustand stores: session, settings (persisted), history
- Type-safe tauriCommands.ts matching Rust backend types exactly
- 8 IT domain system prompts (Linux, Windows, Network, K8s, DB, Virt, HW, Obs)

DevOps:
- .woodpecker/test.yml: rustfmt, clippy, cargo test, tsc, vitest on every push
- .woodpecker/release.yml: linux/amd64 + linux/arm64 builds, Gogs release upload

Verified:
- cargo check: zero errors
- tsc --noEmit: zero errors
- vitest run: 13/13 unit tests passing

Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>
2026-03-14 22:36:25 -05:00

102 lines
2.8 KiB
Rust

use crate::pii::patterns::get_patterns;
use crate::pii::{PiiSpan, PiiType};
use regex::Regex;
pub struct PiiDetector {
patterns: Vec<(PiiType, Regex)>,
}
impl PiiDetector {
pub fn new() -> Self {
PiiDetector {
patterns: get_patterns(),
}
}
pub fn detect(&self, text: &str) -> Vec<PiiSpan> {
let mut spans: Vec<PiiSpan> = Vec::new();
for (pii_type, regex) in &self.patterns {
for mat in regex.find_iter(text) {
spans.push(PiiSpan::new(
pii_type.clone(),
mat.start(),
mat.end(),
mat.as_str().to_string(),
));
}
}
// Sort by start position
spans.sort_by_key(|s| s.start);
// Remove overlapping spans (keep longer one)
let mut filtered: Vec<PiiSpan> = Vec::new();
for span in spans {
if let Some(last) = filtered.last() {
if span.start < last.end {
// Overlap: keep the longer one
if span.end - span.start > last.end - last.start {
filtered.pop();
filtered.push(span);
}
continue;
}
}
filtered.push(span);
}
filtered
}
}
impl Default for PiiDetector {
fn default() -> Self {
Self::new()
}
}
#[cfg(test)]
mod tests {
use super::*;
#[test]
fn test_detect_ipv4() {
let detector = PiiDetector::new();
let text = "Connected to 192.168.1.100 from 10.0.0.1";
let spans = detector.detect(text);
let ipv4_spans: Vec<_> = spans.iter().filter(|s| s.pii_type == "IPv4").collect();
assert!(!ipv4_spans.is_empty());
assert!(ipv4_spans.iter().any(|s| s.original == "192.168.1.100"));
}
#[test]
fn test_detect_email() {
let detector = PiiDetector::new();
let text = "User admin@example.com logged in";
let spans = detector.detect(text);
assert!(spans.iter().any(|s| s.pii_type == "Email"));
}
#[test]
fn test_detect_bearer_token() {
let detector = PiiDetector::new();
let text = "Authorization: Bearer eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.test";
let spans = detector.detect(text);
assert!(spans.iter().any(|s| s.pii_type == "Bearer"));
}
#[test]
fn test_no_overlap() {
let detector = PiiDetector::new();
let text = "IP: 192.168.1.1 user: test@test.com";
let spans = detector.detect(text);
// Verify no two spans overlap
for i in 0..spans.len() {
for j in (i + 1)..spans.len() {
assert!(spans[i].end <= spans[j].start, "Spans overlap!");
}
}
}
}