ADVERSARIAL VALIDATION PROCEDURE
Pen Test · Red Team · Purple Team · Cloud Attack Sim · App Test · OT Safety · Finding-to-Closure
| Document ID | CERG-PRC-AV-001 |
| Version | 1.2 |
| Status | Approved |
| Classification | Public |
| Owner | Adversarial Testing Lead |
| Parent Policy | CERG-POL-001 - Cybersecurity Policy |
| Supporting Documents | CERG-GOV-CB-001 · CERG-STD-LM-001 · CERG-STD-OT-001 · CERG-STD-CUI-001 · CERG-PRC-VM-001 · CERG-PRC-RM-001 · CERG-PRC-AR-001 · CERG-PRC-TPRM-001 · CERG-PLN-IR-001 · CERG-GOV-CEF-001 · CERG-GOV-IMPREG-001 · CERG-PRC-LL-001 |
| Review Cycle | Annual / On material program change |
| Frameworks | NIST 800-53r5 (CA-8) · NIST 800-115 · PTES · MITRE ATT&CK · MITRE D3FEND · OWASP WSTG / MASTG |
| Regulations | NERC-CIP CIP-007 (vulnerability assessment) · CMMC L2 (3.11.2) · SOX (indirect via control testing) |
| Environments | All - IT · cloud · SaaS · OT (with safety constraints) · application · identity |
Table of Contents
- Purpose and Scope
- Engagement Types
- Annual Cadence and Scoping
- Rules of Engagement (RoE) Template
- Red-Team Engagement Charter
- Purple-Team Detection Validation
- Cloud Attack Simulation
- Application Penetration Test Checklist
- OT Adversarial Assessment, Safety Protocol
- Finding Rating, Routing, and Retest
- Systemic Analysis and Program Feedback Loop
- Roles and Responsibilities
- Evidence Retention
- Regulatory and Framework Alignment Summary
- Document Control
1. Purpose and Scope
The Cybersecurity Policy requires periodic adversarial testing. The exposure management procedure names the adversarial activities CERG performs. The risk taxonomy distinguishes adversarial testing from vulnerability scanning. Until this procedure, the operating model, scope, rules, safety, finding routing, evidence, for adversarial validation was implicit.
This procedure makes it explicit. It applies to every CERG-led or CERG-overseen adversarial engagement: external pen test, internal pen test, cloud red-team / attack simulation, web/mobile/API pen test, OT adversarial assessment, purple-team detection validation, and social-engineering work conducted with the adjacent Awareness function.
In CERG, penetration testing is one engagement type within the broader adversarial validation capability. It does not replace red-team operations, purple-team detection validation, cloud attack simulation, OT-safe assessment, or other authorized testing where those activities are in scope.
Adversarial Validation Is the Antidote to Drift
A control library that exists on paper but has never been tested is a hope. Adversarial validation tests the assumption that the control library implements actual defense. CERG runs it on a cadence; documents the rules; routes findings into exposure management or the risk register; and uses purple-team work to upgrade detection, not just to score the team.
2. Engagement Types
| Type | Goal | Default Frequency | Adversary Model |
|---|---|---|---|
| External Pen Test | Validate the perimeter - internet-facing surface, edge identity, public services. | Annual | Unauthenticated external attacker. |
| Internal Pen Test | Validate post-foothold lateral movement, privilege escalation, segmentation. | Annual | Authenticated low-privilege user / compromised workstation. |
| Cloud Red Team / Attack Sim | Validate cloud control plane, identity, KMS, logging detection. | Annual + on material cloud change | Compromised cloud identity / supply-chain attack on cloud workloads. |
| Application Pen Test | Validate web / mobile / API surface of in-scope applications. | Annual for T1/T2 apps; risk-based for others; pre-launch for new apps. | Authenticated and unauthenticated. |
| OT Adversarial Assessment | Validate OT boundary, ESP/EAP, jump server, OT detection set. | Per CIP-007 cadence (at least every 15 months for BES) | Compromised IT-side identity attempting OT pivot; vendor-channel compromise. |
| Purple-Team Detection Validation | Validate the day-one + tuned detection set fires as designed. | Quarterly | ATT&CK technique emulation. |
| Social Engineering | Validate identity and awareness controls; phishing-resistance. | Per Awareness program | External commodity + targeted scenarios. |
| Tabletop / Crisis Simulation | Validate IR plan + CERG interfaces during a simulated incident. | Annual | Scenario-driven. |
2.1 External Tester Management
When adversarial validation is conducted by an external firm rather than internal staff, the following requirements apply.
Minimum Tester Qualifications
| Engagement Type | Minimum Qualification |
|---|---|
| Network / infrastructure penetration test | OSCP, GPEN, or equivalent; 2+ years of hands-on testing experience |
| Web application penetration test | OSWE, GWAPT, or equivalent; demonstrated web app testing portfolio |
| Red team engagement | GXPN, OSEP, or equivalent; 5+ years of adversarial emulation experience |
| Cloud security assessment | Cloud-specific certification (AWS Security, Azure Security, GCP Professional Security) or equivalent |
| OT / ICS assessment | GICSP, GRID, or equivalent; operational technology experience with safety-aware methodology |
| Social engineering engagement | SE-specific training or certification; understanding of legal and ethical boundaries |
Selection and Vetting Requirements
| Requirement | Detail |
|---|---|
| Conflict of interest disclosure | Firm must disclose any prior or current relationship with the organization’s competitors, vendors, or key personnel that could compromise objectivity |
| Background checks | All testers assigned to the engagement must have passed a background check within the last 12 months; evidence of clearance retained by the Vendor Risk Analyst |
| Insurance | Firm must carry professional liability / errors and omissions insurance appropriate to the engagement scope; certificate of insurance on file |
| References | At least two references from engagements of comparable scope and sensitivity in the last 24 months |
Vendor Rotation Policy
- External testing vendors are rotated at minimum every 3 years for each scope area (network, application, cloud, OT, social engineering).
- Rotation applies to the firm, not individual testers.
- Exceptions require CISO approval and are documented in the risk register.
Contractual Requirements
- Non-disclosure agreement (NDA) covering all findings, data accessed, and methods used
- Data handling: all customer data accessed during testing is treated as Restricted; no data leaves the organization’s control without explicit authorization
- Evidence destruction: upon engagement closure or within 30 days, the firm certifies destruction of all engagement data, findings drafts, and tool output in their possession
- Right-to-audit: the organization reserves the right to audit the firm’s handling of engagement data
- Notification of subcontractors: the firm must disclose any subcontractors and their qualifications before engagement start
3. Annual Cadence and Scoping
CERG produces an annual Adversarial Validation Plan that schedules the engagements above against the in-scope environments. The plan is reviewed and approved by the CISO; it is updated quarterly to reflect emerging threats and program priorities.
Scoping inputs:
- Highest-residual-score risks in the register.
- Recently introduced systems and architectures (
CERG-PRC-AR-001Phase 5 records). - Regulatory cadence requirements (CIP, CMMC).
- ATT&CK coverage gaps identified by detection engineering.
- Threat intelligence pointing at adversary activity relevant to the organization.
4. Rules of Engagement (RoE) Template
Every engagement has a signed RoE before any activity begins. The template:
RULES OF ENGAGEMENT - <Engagement Name> AV-YYYY-NNNN
1. ENGAGEMENT IDENTITY
Type : (External / Internal / Cloud / App / OT / Purple / SE / TT)
Sponsor (named role) :
Test Lead :
Period :
Time Windows :
2. OBJECTIVES
What we are testing :
Success criteria :
Risk-register / control gaps this engagement is intended to validate
3. SCOPE - IN
IPs / URLs / Apps / Cloud accounts / OT segments / Identity tenants explicitly in scope
4. SCOPE - OUT
Explicit out-of-scope: e.g., production-impacting denial-of-service, mass account lockouts, third-party-hosted out-of-scope assets
5. PROHIBITED ACTIONS
(Defaults below; engagement may add. None may be removed.)
- No denial-of-service against production
- No data destruction or modification
- No persistence beyond engagement window
- No exfiltration of regulated data (CUI / BCSI / PHI / PII); proof-of-concept counts; data is staged and discarded
- No exploitation of safety-impacting OT systems (Section 9)
- No social engineering of named C-suite without explicit prior approval
- No physical intrusion outside agreed scope
6. SAFETY AND ABORT
Stop conditions : (defined; e.g., OT process alarms, anomalous business impact)
Abort signal : (named individual + backup; mechanism - phone / Signal channel)
Operator on-call (OT) : (if OT)
Incident handoff path : if findings cross the "test" → "real incident" threshold
7. AUTHORIZATION
Authorization letter signed by :
Distribution :
Authority to act in our environment
8. COMMUNICATIONS
SOC informed? : Yes - passive (no detection assistance) / Yes - active (purple)
Liaison on Tester side:
Liaison on CERG side :
Reporting cadence : (daily standups / weekly status / final report)
9. EVIDENCE COLLECTION
What is captured :
How it's stored :
What is sanitized in final report
10. CONFIDENTIALITY
Findings classification (default: Restricted)
Handling : encrypted at rest, named distribution, no email of unredacted
11. SIGNATURES
Tester Lead :
CERG Sponsor :
System / Business Owner :
CISO (red team / OT / cloud red team) :
5. Red-Team Engagement Charter
Red-team engagements are scope-broader, time-longer, goal-driven simulations of a realistic adversary. Each red team has a Charter in addition to its RoE.
RED-TEAM CHARTER - <Engagement Name> AV-YYYY-NNNN
A. ADVERSARY MODEL
Threat actor profile :
Capability : (commodity / advanced / nation-state-equivalent)
Resourcing (time, tools, ops infrastructure)
B. OBJECTIVES - IN PRIORITY ORDER
Crown Jewel(s) to attempt to reach
Detection escape required? :
Persistence required? :
Specific TTPs to emulate : (ATT&CK technique IDs)
C. ENGAGEMENT WINDOW
Start / End :
No-Op windows : (financial close, regulatory exam, major event)
D. PRE-AGREED LIMITS
Production-impact threshold :
OT touch : (forbidden unless explicit)
E. WHITE-CELL
Named coordinators on both sides; channels for clarification without breaking the test
6. Purple-Team Detection Validation
Purple work is the collaboration between offensive and detection engineering. It is not a graded exam of the SOC; it is a measurement of detection coverage and the basis for upgrading rules and baselines.
6.1 Cadence and Coverage
- Quarterly purple-team cycle covering a rotating slice of the ATT&CK matrix in scope.
- Annual full-scope purple-team across the IT spine; semi-annual on cloud sub-matrix; annual on OT sub-matrix.
- Each cycle has 8–20 techniques tested; the cycle’s selection is risk-driven (recent threat intel, recent infrastructure changes, prior-cycle failures).
6.2 Validation Workflow
- Plan, choose techniques (ATT&CK IDs); map to expected detections in the SIEM; document expected alert.
- Execute, emulate technique in non-production where possible; in production with controlled, documented action where required.
- Observe, did the expected detection fire? With expected priority? In expected time?
- Classify, Pass · Pass-with-Latency · Pass-with-Tuning-Needed · Fail (no detection) · Fail-Suppressed (existed but suppressed).
- Action, Pass: no change. Pass-with-Tuning: tune. Fail / Fail-Suppressed: open detection backlog item + risk register entry until restored.
- Reflect, feed result into detection inventory; update DT-002 metric in
CERG-GOV-MTR-001.
6.3 Purple-Team Outcome to Baseline
The Purple Cycle Updates Detection, Not Just the Scorecard
If a technique fires nine times out of ten because logs were missing for one source, the fix is to fix the source, not to re-grade the test. If a technique fires reliably but the alert routes to the wrong queue, the fix is the queue. Purple cycles end with a list of concrete changes to the detection set; if they don’t, the cycle is incomplete.
7. Cloud Attack Simulation
Cloud engagements emulate adversary activity at the cloud control plane and identity layer, historically the most-attacked surface in modern enterprises.
| Focus | Examples |
|---|---|
| Identity | OAuth consent grant abuse; consent phishing; token theft; federation tampering. |
| Cloud control plane | IAM privilege enumeration and escalation; CloudTrail / Activity Log disable attempts; key policy changes; cross-account assumption. |
| KMS / Secrets | Unauthorized key access; envelope-encryption bypass attempts. |
| Workloads | Container escape; metadata service abuse; misconfigured serverless. |
| Logging / monitoring | Visibility evasion - disable, alter, route around. |
Cloud engagements explicitly include detection validation; CSPM/SSPM signals and cloud-native detection (GuardDuty, Defender for Cloud, Security Command Center) are exercised.
8. Application Penetration Test Checklist
Application tests use OWASP WSTG (web), MASTG (mobile), and API Security Top 10 as the technical reference. The checklist below is the minimum coverage at scoping; specific apps may need more.
| Domain | Coverage |
|---|---|
| Authentication | MFA enforcement, credential stuffing resistance, password-reset flow, session handling. |
| Authorization | IDOR, function-level access, multi-tenant isolation, RBAC enforcement. |
| Input handling | Injection (SQL, command, template), deserialization, file upload, SSRF. |
| Output / data exposure | Information disclosure, sensitive data in responses, error verbosity. |
| Configuration | Headers (HSTS, CSP, etc.), TLS conformance to CERG-STD-CR-001, exposed admin endpoints. |
| Business logic | Workflow bypass, race conditions, abuse of intended functionality. |
| API | OWASP API Top 10: object-level, function-level, mass assignment, rate limiting, etc. |
| Mobile | OWASP MASTG: storage, transport, platform interaction, code quality, resilience. |
8.1 Social Engineering Engagements
Social engineering tests the human layer of security controls. These engagements carry unique legal, ethical, and operational risks that require explicit guardrails.
Scope Definition
| Engagement Subtype | Description | Typical Scope |
|---|---|---|
| Phishing simulation | Simulated phishing email campaign to test user awareness and reporting behavior | Organization-wide or targeted departments |
| Vishing (voice phishing) | Simulated phone-based social engineering to extract information or credentials | Named individuals or departments with CISO approval |
| Smishing (SMS phishing) | Simulated text message-based social engineering | Named individuals or departments with CISO approval |
| Physical social engineering | Tailgating, badge cloning, or pretext-based physical access attempts | Named facilities with Facility Security coordination |
| USB drop | Simulated malicious USB devices placed in common areas | Named facilities with Facility Security coordination |
Pretexting Rules and Limitations
- Pretexts must not impersonate law enforcement, government officials, or regulatory authorities.
- Pretexts must not exploit personal tragedy, medical conditions, or family emergencies.
- Pretexts must not target minors or non-employees.
- Pretexts must not threaten employment status, compensation, or benefits.
- All pretexts are defined in the RoE and approved by the Engagement Sponsor.
Prohibited Targets (require explicit CISO approval)
- C-suite executives and their direct administrative staff
- Board members
- Legal counsel (internal and external)
- Human Resources personnel
- Personnel on leave (medical, family, or other protected leave)
Coordination Protocol
| Stakeholder | When | Purpose |
|---|---|---|
| Legal | Before RoE approval | Review pretexts for legal risk; confirm no violation of wiretapping, recording, or privacy laws |
| HR | Before RoE approval | Confirm no employee relations risk |
| Facility Security | Before physical or USB engagements | Coordinate access testing; prevent alarm triggers or law enforcement response |
| SOC / Detection Team | Before go-live | Prevent the engagement from being treated as a real incident |
| Incident Commander | Before go-live | Pre-brief: if engagement triggers detection, the IC knows it’s a test |
Reporting Requirements
Social engineering findings are reported separately from technical findings. The report includes:
- Engagement subtype and scope
- Number of targets and interactions
- Success rate per subtype and per department
- Root cause themes (e.g., lack of verification procedure, urgency exploitation, authority bias)
- Recommended awareness and control improvements
- Comparison to prior social engineering engagements (if applicable)
9. OT Adversarial Assessment: Safety Protocol
OT testing is uniquely constrained. Safety overrides any test objective.
9.1 Prohibited Without Explicit Approval
The following are prohibited absent a documented exception with engineering and operations sign-off:
- Active scanning of live SCADA/HMI/RTU/relay surfaces. Passive monitoring and engineering-supervised authenticated checks only.
- Exploitation that could affect setpoints, alarms, relays, breakers, or control logic.
- Denial-of-service against any OT segment, including saturation testing.
- Disabling of OT monitoring or logging.
- Persistence on OT systems.
- Engagement during operational events: storms, demand peaks, planned outages, regulatory windows.
9.2 Permitted Approaches
- IT-side validation of paths into OT (jump server, vendor remote access, IT/OT crossing).
- Lab / digital-twin replication of OT systems for offensive testing.
- Passive observation and documentation of OT exposure.
- Engineering-supervised authenticated checks in a defined window.
- Tabletop / scenario-based assessment with OT operators.
9.3 Authority and Safety Officer
Every OT engagement has a named OT Safety Officer (an operator with substation / process engineering authority) who can stop the test for any reason without justification. The Safety Officer’s “stop” is binding.
The Cost of Getting OT Testing Wrong
A pen-test misstep on an enterprise web app produces a ticket. A misstep on a relay setting group can shed load or trip a substation. CERG conducts OT adversarial work with the same care a power engineer applies in the field, and structures the engagement so the engineer’s call always wins.
10. Finding Rating, Routing, and Retest
10.1 Rating
Findings are rated using a uniform schema regardless of engagement type, aligning to CERG-PRC-VM-001 severity bands so they can co-route into exposure management:
| Severity | Definition |
|---|---|
| Critical | Direct path to Crown Jewel, regulated data, safety impact, or persistent foothold. |
| High | Significant escalation or material exposure of sensitive data; widespread bypass of a control. |
| Medium | Notable control gap with limited exploitability or limited scope. |
| Low | Hygiene or hardening recommendation. |
| Informational | Observations and recommended improvements not tied to a control gap. |
10.2 Routing
| Source of Finding | Routes To |
|---|---|
| Specific software / asset vulnerability | CERG-PRC-VM-001 finding + risk register entry per CERG-PRC-RM-001 if Critical/High |
| Control / design weakness | Risk register entry directly per CERG-PRC-RM-001 |
| Detection gap | Detection backlog + risk register if material |
| Process / runbook weakness | Owning procedure (AR / IR / TPRM) + risk register if material |
| Vendor / supply chain weakness | TPRM record per CERG-PRC-TPRM-001 |
| OT-specific weakness | OT exposure management workflow per CERG-PRC-VM-001 §4.3 and §6.3 + risk register |
10.3 Retest and Closure
- Critical findings retested within 30 days of remediation claim.
- High findings retested within 60 days.
- Medium/Low retested at next routine engagement or by sampled VM scan.
- A finding moves to Closed only after retest demonstrates remediation; a finding that cannot be remediated moves through the risk register acceptance path with explicit Approver.
- Closed findings inform the threat model for the affected system at next architecture review.
10.4 Escalation for Stalled Remediation
When remediation stalls beyond the retest window, the finding escalates.
Escalation Triggers
| Trigger | Action |
|---|---|
| Critical finding not remediated at 30 days | Adversarial Testing Lead escalates to Risk Pillar Leader |
| Critical finding not remediated at 45 days | Risk Pillar Leader escalates to CISO |
| Critical finding not remediated at 60 days | CISO escalates to Executive; risk acceptance required |
| High finding not remediated at 60 days | Adversarial Testing Lead escalates to Risk Pillar Leader |
| High finding not remediated at 90 days | Risk Pillar Leader escalates to CISO |
| Medium finding not remediated at 120 days | Adversarial Testing Lead escalates to system owner’s pillar leader |
Escalation Ladder
Adversarial Testing Lead → Risk Pillar Leader → CISO → Executive Sponsor
At each level, the responsible party must either (a) produce a credible remediation plan with a committed date, (b) initiate a risk acceptance through CERG-PRC-RM-001, or (c) provide a written rationale. Non-response at any level automatically escalates after 5 business days.
Risk Acceptance as the Only Alternative
When remediation is not feasible, risk acceptance requires: a written risk statement per PRC-RM-001 §4.1, documented compensating controls, approval per the authority matrix, and a scheduled re-review date. A finding that is neither remediated nor risk-accepted is a program gap, not an operational delay.
10.5 Adversarial Validation Final Report
Every engagement produces a Final Report - the authoritative record of what was tested, found, and recommended.
Required Sections
| Section | Content |
|---|---|
| Executive Summary | Engagement type, scope, dates; findings summary by severity; top three risks; overall posture assessment; recommended next actions |
| Engagement Overview | RoE reference; scope (systems, networks, applications); out-of-scope items; timeline; tester(s); methodology summary |
| Methodology | Reconnaissance and enumeration techniques; exploitation approach; post-exploitation and lateral movement; tools and frameworks used |
| Findings | Each finding includes ID, severity, title, description, affected assets, evidence (annotated screenshots, command output), risk statement, remediation recommendation, references (CVE, ATT&CK, CWE) |
| Systemic Analysis | Patterns across findings; control themes that failed; root cause candidates |
| Trend Analysis (if applicable) | Comparison to prior engagements; findings closed since last engagement; new vs. recurring findings; trend direction |
| Appendices | Evidence artifacts, exploit code references (sanitized), timeline of key events, tools inventory |
Evidence Artifact Standards
- Screenshots: Annotated with arrows or captions highlighting relevant detail; include timestamp and host identifier
- Command output: Full command and output with timestamp; truncation acceptable with notation
- Packet captures: Time-stamped with source/destination; filtered to relevant conversation
- Exploit code: Referenced by tool and version, not reproduced in full
10.6 Inter-Rater Reliability for Finding Ratings
Different testers may rate the same finding differently. The following process ensures severity ratings are consistent and defensible.
Peer Review for High-Severity Findings
All Critical and High findings receive a peer review by a second qualified tester before the finding is finalized. The peer reviewer validates: - The severity rating against the definitions in Section 10.1 - That the evidence supports the severity determination - That the finding is not a duplicate of a known issue
If the peer reviewer disagrees with the severity, the Adversarial Testing Lead facilitates resolution. The final severity is the consensus rating, or the higher of the two if consensus cannot be reached.
Calibration Sessions
Quarterly calibration sessions are conducted using a sample of anonymized findings from recent engagements. All testers independently rate each finding. Results are compared and discussed. The purpose is to: - Align severity interpretation across testers - Identify systematic over-rating or under-rating patterns - Refine severity definitions where ambiguity is found
The Adversarial Testing Lead facilitates calibration and documents outcomes. Calibration results inform tester development and, where appropriate, updates to this procedure’s severity definitions.
Dispute Resolution
| Dispute | Resolution |
|---|---|
| Tester and peer reviewer disagree on severity | Adversarial Testing Lead facilitates; default to higher severity if unresolved |
| Tester and system owner disagree on severity | Risk Pillar Leader reviews; severity determined by risk to the organization, not by remediation difficulty |
| Tester and CISO disagree on Critical classification | CISO makes final determination; rationale documented |
11. Systemic Analysis and Program Feedback Loop
An Adaptive program does not stop at fixing individual pen-test findings. It asks: why did four of six DMZ servers share the same misconfiguration? What standard or procedure failed to prevent this? How do we change the program so the next engagement does not repeat the same class of finding?
This section defines the post-engagement systemic analysis that converts adversarial validation findings into program improvements.
11.1 Post-Engagement Systemic Analysis
After every adversarial validation engagement, the Adversarial Testing Lead produces two outputs: a findings report (existing) and a systemic analysis (new). The systemic analysis answers four questions:
- Which findings share a common root cause? Example: four findings relate to default credentials on newly provisioned systems : the provisioning process does not enforce credential rotation. This is not four separate problems; it is one systemic weakness manifesting in four locations.
- Which findings indicate a standard or procedure gap? Example: DMZ web servers all had the same TLS misconfiguration : STD-CFG-001 or STD-NET-001 does not specify DMZ TLS requirements clearly enough.
- Which findings indicate a control effectiveness gap? Example: WAF was deployed per CB-001 but three bypass techniques worked : the WAF control is Implemented but not effective (per CEF-001).
- Which findings indicate a detection gap? Example: testers pivoted laterally for four hours without an alert : detection rules need updating for the observed technique chain.
11.2 Systemic Finding Routing
Each systemic finding is routed to the appropriate owner for program-level action:
| Systemic Finding Type | Routed To | Tracking Mechanism |
|---|---|---|
| Standard or procedure gap | Governance Pillar Leader (Policy & Standards Manager) | IMPREG-001 (Type: Standard revision or Procedure update) |
| Control effectiveness gap | Accountable pillar leader (per CB-001) | IMPREG-001 (Type: Control amendment) + CEF-001 control failure analysis |
| Detection gap | Risk Pillar Leader (Detection Engineer) | Detection backlog per LM-001 + IMPREG-001 if systemic |
| Process gap | Accountable procedure owner (per RAC-001) | IMPREG-001 (Type: Procedure update) |
| Tooling or capability gap | Relevant pillar leader | IMPREG-001 (Type: New capability) + CISO if budget required |
Routed systemic findings are tracked in the improvement register (IMPREG-001) per the improvement lifecycle. The source field records the engagement ID and date.
11.3 Prevention Verification
At the next adversarial validation engagement of the same type, scope, or target environment, the engagement plan includes re-testing the systemic weaknesses identified in the prior engagement.
- If the same class of finding does NOT recur: the systemic improvement is verified Effective in IMPREG-001.
- If the same class of finding recurs: the improvement is marked Ineffective or Partially Effective, re-opened in IMPREG-001, and the root cause analysis is re-examined. The fact that a deliberate program change did not prevent recurrence is itself a critical lesson.
The Recurrence Test
An assessor reviewing two consecutive pen test reports should see: (a) the specific vulnerabilities from the prior test are closed, AND (b) the systemic weakness that produced them no longer produces new findings of the same class. If the second report finds different specific vulnerabilities but the same systemic root cause : different default credentials on different systems : the improvement failed. The program that only fixes individual findings and never the systemic cause is operating at Repeatable, not Adaptive.
11.4 Annual Trend Analysis
Annually, the Adversarial Testing Lead produces an adversarial validation trend analysis covering all engagements in the trailing 12 months:
- Are total findings increasing or decreasing? Segmented by severity.
- Are systemic weaknesses repeating across engagements?
- Which control families (per CB-001) produce the most findings? The most Critical and High findings?
- Which control families show improvement (fewer findings year-over-year)?
- How many systemic findings were routed to IMPREG-001, and what was their verification outcome?
The trend analysis feeds into: - The control effectiveness program (CEF-001): control families with persistently high finding rates trigger a control failure analysis - The risk appetite calibration (RMF-001): a finding-rate trend that undermines a risk appetite assumption triggers recalibration - The annual maturity assessment (MAT-001): finding-rate trends are evidence of control effectiveness or ineffectiveness
12. Roles and Responsibilities
| Role | Responsibility |
|---|---|
| Adversarial Testing Lead | Owns this procedure. Produces the annual plan, engagement RoE/Charter, findings report, and systemic analysis. Routes systemic findings. Produces the annual trend analysis. |
| Risk Pillar Leader | Accountable for the adversarial validation program. Approves the annual plan. Reviews systemic analysis outputs. |
| CISO | Approves the annual plan. Authorizes red-team and OT engagements. Receives Critical finding notifications. |
| Engineering Pillar Leader | Receives and acts on design, architecture, and configuration systemic findings. Participates in scoping. |
| Governance Pillar Leader | Receives standard and procedure gap systemic findings. Coordinates catalog amendments resulting from systemic improvements. |
| Detection Engineer | Receives detection gap findings. Updates detection rules and reports on coverage improvement. |
| Exposure Management Lead | Receives specific vulnerability findings. Tracks remediation to closure per PRC-VM-001. |
| Risk Register Owner | Records material findings as risk register entries. Links risk entries to improvement entries when systemic. |
| OT Safety Officer | Named for each OT engagement. Holds binding stop authority. Reviews OT scope and safety constraints. |
13. Evidence Retention
| Artifact | Retention | Access |
|---|---|---|
| Signed RoE | Engagement + 7 years | Sponsor, CISO, Audit |
| Signed Charter (red team) | Engagement + 7 years | Sponsor, CISO, Audit |
| Engagement plan / scope | Engagement + 5 years | Sponsor, Test Lead, CISO |
| Tester daily logs / standup notes | Engagement + 2 years | Test Lead, CERG sponsor |
| Final report (sanitized) | Engagement + 7 years | Distribution per RoE |
| Final report (full / unsanitized) | Engagement + 5 years | Named distribution only |
| Detection validation results (purple) | Engagement + 3 years | Detection eng + audit |
| Finding records | Per VM tool retention; closed findings retained per audit retention | - |
| Retest evidence | Per finding record retention | - |
Final reports are distributed under the confidentiality terms in the RoE. Full reports are not emailed; access is via a controlled location with audit logging.
Findings Carry Sensitive Information About How to Compromise Us
An unsanitized pen-test report is a road map for attackers and should be treated as Restricted. Distribution is named, encrypted, and audited; “wider sharing” requests are reviewed by the Engagement Sponsor.
14. Regulatory and Framework Alignment Summary
| Regulation / Framework | Where in This Procedure |
|---|---|
| NIST 800-53r5 CA-8 | All sections |
| NIST 800-115 | All sections |
| PTES | Sections 3–4 |
| MITRE ATT&CK | Sections 6, 7 |
| OWASP WSTG / MASTG / API Top 10 | Section 8 |
| NERC-CIP CIP-007 R3.2 (vulnerability assessment) | Sections 9, 10 |
| CMMC L2 (3.11.2) | Sections 3, 8 |
15. Document Control
| Document ID | CERG-PRC-AV-001 |
| Version | 1.2 |
| Approved By | CISO |
| Next Review | Annual / on material program change |
| Change Log | 1.0 - Initial publication. Engagement types, RoE, charter, purple, cloud, app, OT safety, rating/routing, evidence retention. · 1.1 - Added Section 11 Systemic Analysis and Program Feedback Loop: post-engagement systemic analysis, systemic finding routing, prevention verification, and annual trend analysis. · 1.2 - Restored Section 12 Roles and Responsibilities. Renumbered Evidence Retention to Section 13, Regulatory to 14, Document Control to 15. |
Source: procedures/CERG-PRC-AV-001_Adversarial_Validation_Procedure.md ·
Download .md ·
View on GitHub