ADVERSARIAL VALIDATION PROCEDURE

Pen Test · Red Team · Purple Team · Cloud Attack Sim · App Test · OT Safety · Finding-to-Closure


Document ID	CERG-PRC-AV-001
Version	1.2
Status	Approved
Classification	Public
Owner	Adversarial Testing Lead
Parent Policy	CERG-POL-001 - Cybersecurity Policy
Supporting Documents	CERG-GOV-CB-001 · CERG-STD-LM-001 · CERG-STD-OT-001 · CERG-STD-CUI-001 · CERG-PRC-VM-001 · CERG-PRC-RM-001 · CERG-PRC-AR-001 · CERG-PRC-TPRM-001 · CERG-PLN-IR-001 · CERG-GOV-CEF-001 · CERG-GOV-IMPREG-001 · CERG-PRC-LL-001
Review Cycle	Annual / On material program change
Frameworks	NIST 800-53r5 (CA-8) · NIST 800-115 · PTES · MITRE ATT&CK · MITRE D3FEND · OWASP WSTG / MASTG
Regulations	NERC-CIP CIP-007 (vulnerability assessment) · CMMC L2 (3.11.2) · SOX (indirect via control testing)
Environments	All - IT · cloud · SaaS · OT (with safety constraints) · application · identity

Purpose and Scope
Engagement Types
Annual Cadence and Scoping
Rules of Engagement (RoE) Template
Red-Team Engagement Charter
Purple-Team Detection Validation
Cloud Attack Simulation
Application Penetration Test Checklist
OT Adversarial Assessment, Safety Protocol
Finding Rating, Routing, and Retest
Systemic Analysis and Program Feedback Loop
Roles and Responsibilities
Evidence Retention
Regulatory and Framework Alignment Summary
Document Control

1. Purpose and Scope

The Cybersecurity Policy requires periodic adversarial testing. The exposure management procedure names the adversarial activities CERG performs. The risk taxonomy distinguishes adversarial testing from vulnerability scanning. Until this procedure, the operating model, scope, rules, safety, finding routing, evidence, for adversarial validation was implicit.

This procedure makes it explicit. It applies to every CERG-led or CERG-overseen adversarial engagement: external pen test, internal pen test, cloud red-team / attack simulation, web/mobile/API pen test, OT adversarial assessment, purple-team detection validation, and social-engineering work conducted with the adjacent Awareness function.

In CERG, penetration testing is one engagement type within the broader adversarial validation capability. It does not replace red-team operations, purple-team detection validation, cloud attack simulation, OT-safe assessment, or other authorized testing where those activities are in scope.

Adversarial Validation Is the Antidote to Drift

A control library that exists on paper but has never been tested is a hope. Adversarial validation tests the assumption that the control library implements actual defense. CERG runs it on a cadence; documents the rules; routes findings into exposure management or the risk register; and uses purple-team work to upgrade detection, not just to score the team.

2. Engagement Types

Type	Goal	Default Frequency	Adversary Model
External Pen Test	Validate the perimeter - internet-facing surface, edge identity, public services.	Annual	Unauthenticated external attacker.
Internal Pen Test	Validate post-foothold lateral movement, privilege escalation, segmentation.	Annual	Authenticated low-privilege user / compromised workstation.
Cloud Red Team / Attack Sim	Validate cloud control plane, identity, KMS, logging detection.	Annual + on material cloud change	Compromised cloud identity / supply-chain attack on cloud workloads.
Application Pen Test	Validate web / mobile / API surface of in-scope applications.	Annual for T1/T2 apps; risk-based for others; pre-launch for new apps.	Authenticated and unauthenticated.
OT Adversarial Assessment	Validate OT boundary, ESP/EAP, jump server, OT detection set.	Per CIP-007 cadence (at least every 15 months for BES)	Compromised IT-side identity attempting OT pivot; vendor-channel compromise.
Purple-Team Detection Validation	Validate the day-one + tuned detection set fires as designed.	Quarterly	ATT&CK technique emulation.
Social Engineering	Validate identity and awareness controls; phishing-resistance.	Per Awareness program	External commodity + targeted scenarios.
Tabletop / Crisis Simulation	Validate IR plan + CERG interfaces during a simulated incident.	Annual	Scenario-driven.

2.1 External Tester Management

When adversarial validation is conducted by an external firm rather than internal staff, the following requirements apply.

Minimum Tester Qualifications

Engagement Type	Minimum Qualification
Network / infrastructure penetration test	OSCP, GPEN, or equivalent; 2+ years of hands-on testing experience
Web application penetration test	OSWE, GWAPT, or equivalent; demonstrated web app testing portfolio
Red team engagement	GXPN, OSEP, or equivalent; 5+ years of adversarial emulation experience
Cloud security assessment	Cloud-specific certification (AWS Security, Azure Security, GCP Professional Security) or equivalent
OT / ICS assessment	GICSP, GRID, or equivalent; operational technology experience with safety-aware methodology
Social engineering engagement	SE-specific training or certification; understanding of legal and ethical boundaries

Selection and Vetting Requirements

Requirement	Detail
Conflict of interest disclosure	Firm must disclose any prior or current relationship with the organization’s competitors, vendors, or key personnel that could compromise objectivity
Background checks	All testers assigned to the engagement must have passed a background check within the last 12 months; evidence of clearance retained by the Vendor Risk Analyst
Insurance	Firm must carry professional liability / errors and omissions insurance appropriate to the engagement scope; certificate of insurance on file
References	At least two references from engagements of comparable scope and sensitivity in the last 24 months

Vendor Rotation Policy

External testing vendors are rotated at minimum every 3 years for each scope area (network, application, cloud, OT, social engineering).
Rotation applies to the firm, not individual testers.
Exceptions require CISO approval and are documented in the risk register.

Contractual Requirements

Non-disclosure agreement (NDA) covering all findings, data accessed, and methods used
Data handling: all customer data accessed during testing is treated as Restricted; no data leaves the organization’s control without explicit authorization
Evidence destruction: upon engagement closure or within 30 days, the firm certifies destruction of all engagement data, findings drafts, and tool output in their possession
Right-to-audit: the organization reserves the right to audit the firm’s handling of engagement data
Notification of subcontractors: the firm must disclose any subcontractors and their qualifications before engagement start

3. Annual Cadence and Scoping

CERG produces an annual Adversarial Validation Plan that schedules the engagements above against the in-scope environments. The plan is reviewed and approved by the CISO; it is updated quarterly to reflect emerging threats and program priorities.

Scoping inputs:

Highest-residual-score risks in the register.
Recently introduced systems and architectures (CERG-PRC-AR-001 Phase 5 records).
Regulatory cadence requirements (CIP, CMMC).
ATT&CK coverage gaps identified by detection engineering.
Threat intelligence pointing at adversary activity relevant to the organization.

4. Rules of Engagement (RoE) Template

Every engagement has a signed RoE before any activity begins. The template:

RULES OF ENGAGEMENT - <Engagement Name>      AV-YYYY-NNNN

1. ENGAGEMENT IDENTITY
   Type                  : (External / Internal / Cloud / App / OT / Purple / SE / TT)
   Sponsor (named role)  :
   Test Lead             :
   Period                :
   Time Windows          :

2. OBJECTIVES
   What we are testing   :
   Success criteria      :
   Risk-register / control gaps this engagement is intended to validate

3. SCOPE - IN
   IPs / URLs / Apps / Cloud accounts / OT segments / Identity tenants explicitly in scope

4. SCOPE - OUT
   Explicit out-of-scope: e.g., production-impacting denial-of-service, mass account lockouts, third-party-hosted out-of-scope assets

5. PROHIBITED ACTIONS
   (Defaults below; engagement may add. None may be removed.)
   - No denial-of-service against production
   - No data destruction or modification
   - No persistence beyond engagement window
   - No exfiltration of regulated data (CUI / BCSI / PHI / PII); proof-of-concept counts; data is staged and discarded
   - No exploitation of safety-impacting OT systems (Section 9)
   - No social engineering of named C-suite without explicit prior approval
   - No physical intrusion outside agreed scope

6. SAFETY AND ABORT
   Stop conditions       : (defined; e.g., OT process alarms, anomalous business impact)
   Abort signal          : (named individual + backup; mechanism - phone / Signal channel)
   Operator on-call (OT) : (if OT)
   Incident handoff path : if findings cross the "test" → "real incident" threshold

7. AUTHORIZATION
   Authorization letter signed by   :
   Distribution                     :
   Authority to act in our environment

8. COMMUNICATIONS
   SOC informed?         : Yes - passive (no detection assistance) / Yes - active (purple)
   Liaison on Tester side:
   Liaison on CERG side  :
   Reporting cadence     : (daily standups / weekly status / final report)

9. EVIDENCE COLLECTION
   What is captured      :
   How it's stored       :
   What is sanitized in final report

10. CONFIDENTIALITY
    Findings classification (default: Restricted)
    Handling : encrypted at rest, named distribution, no email of unredacted

11. SIGNATURES
    Tester Lead          :
    CERG Sponsor         :
    System / Business Owner :
    CISO (red team / OT / cloud red team) :

5. Red-Team Engagement Charter

Red-team engagements are scope-broader, time-longer, goal-driven simulations of a realistic adversary. Each red team has a Charter in addition to its RoE.

RED-TEAM CHARTER - <Engagement Name>     AV-YYYY-NNNN

A. ADVERSARY MODEL
   Threat actor profile        :
   Capability                  : (commodity / advanced / nation-state-equivalent)
   Resourcing (time, tools, ops infrastructure)

B. OBJECTIVES - IN PRIORITY ORDER
   Crown Jewel(s) to attempt to reach
   Detection escape required?  :
   Persistence required?       :
   Specific TTPs to emulate    : (ATT&CK technique IDs)

C. ENGAGEMENT WINDOW
   Start / End                 :
   No-Op windows               : (financial close, regulatory exam, major event)

D. PRE-AGREED LIMITS
   Production-impact threshold :
   OT touch                    : (forbidden unless explicit)

E. WHITE-CELL
   Named coordinators on both sides; channels for clarification without breaking the test

6. Purple-Team Detection Validation

Purple work is the collaboration between offensive and detection engineering. It is not a graded exam of the SOC; it is a measurement of detection coverage and the basis for upgrading rules and baselines.

6.1 Cadence and Coverage

Quarterly purple-team cycle covering a rotating slice of the ATT&CK matrix in scope.
Annual full-scope purple-team across the IT spine; semi-annual on cloud sub-matrix; annual on OT sub-matrix.
Each cycle has 8–20 techniques tested; the cycle’s selection is risk-driven (recent threat intel, recent infrastructure changes, prior-cycle failures).

6.2 Validation Workflow

Plan, choose techniques (ATT&CK IDs); map to expected detections in the SIEM; document expected alert.
Execute, emulate technique in non-production where possible; in production with controlled, documented action where required.
Observe, did the expected detection fire? With expected priority? In expected time?
Classify, Pass · Pass-with-Latency · Pass-with-Tuning-Needed · Fail (no detection) · Fail-Suppressed (existed but suppressed).
Action, Pass: no change. Pass-with-Tuning: tune. Fail / Fail-Suppressed: open detection backlog item + risk register entry until restored.
Reflect, feed result into detection inventory; update DT-002 metric in CERG-GOV-MTR-001.

6.3 Purple-Team Outcome to Baseline

The Purple Cycle Updates Detection, Not Just the Scorecard

If a technique fires nine times out of ten because logs were missing for one source, the fix is to fix the source, not to re-grade the test. If a technique fires reliably but the alert routes to the wrong queue, the fix is the queue. Purple cycles end with a list of concrete changes to the detection set; if they don’t, the cycle is incomplete.

7. Cloud Attack Simulation

Cloud engagements emulate adversary activity at the cloud control plane and identity layer, historically the most-attacked surface in modern enterprises.

Focus	Examples
Identity	OAuth consent grant abuse; consent phishing; token theft; federation tampering.
Cloud control plane	IAM privilege enumeration and escalation; CloudTrail / Activity Log disable attempts; key policy changes; cross-account assumption.
KMS / Secrets	Unauthorized key access; envelope-encryption bypass attempts.
Workloads	Container escape; metadata service abuse; misconfigured serverless.
Logging / monitoring	Visibility evasion - disable, alter, route around.

Cloud engagements explicitly include detection validation; CSPM/SSPM signals and cloud-native detection (GuardDuty, Defender for Cloud, Security Command Center) are exercised.

8. Application Penetration Test Checklist

Application tests use OWASP WSTG (web), MASTG (mobile), and API Security Top 10 as the technical reference. The checklist below is the minimum coverage at scoping; specific apps may need more.

Domain	Coverage
Authentication	MFA enforcement, credential stuffing resistance, password-reset flow, session handling.
Authorization	IDOR, function-level access, multi-tenant isolation, RBAC enforcement.
Input handling	Injection (SQL, command, template), deserialization, file upload, SSRF.
Output / data exposure	Information disclosure, sensitive data in responses, error verbosity.
Configuration	Headers (HSTS, CSP, etc.), TLS conformance to `CERG-STD-CR-001`, exposed admin endpoints.
Business logic	Workflow bypass, race conditions, abuse of intended functionality.
API	OWASP API Top 10: object-level, function-level, mass assignment, rate limiting, etc.
Mobile	OWASP MASTG: storage, transport, platform interaction, code quality, resilience.

Social engineering tests the human layer of security controls. These engagements carry unique legal, ethical, and operational risks that require explicit guardrails.

Scope Definition

Engagement Subtype	Description	Typical Scope
Phishing simulation	Simulated phishing email campaign to test user awareness and reporting behavior	Organization-wide or targeted departments
Vishing (voice phishing)	Simulated phone-based social engineering to extract information or credentials	Named individuals or departments with CISO approval
Smishing (SMS phishing)	Simulated text message-based social engineering	Named individuals or departments with CISO approval
Physical social engineering	Tailgating, badge cloning, or pretext-based physical access attempts	Named facilities with Facility Security coordination
USB drop	Simulated malicious USB devices placed in common areas	Named facilities with Facility Security coordination

Pretexting Rules and Limitations

Pretexts must not impersonate law enforcement, government officials, or regulatory authorities.
Pretexts must not exploit personal tragedy, medical conditions, or family emergencies.
Pretexts must not target minors or non-employees.
Pretexts must not threaten employment status, compensation, or benefits.
All pretexts are defined in the RoE and approved by the Engagement Sponsor.

Prohibited Targets (require explicit CISO approval)

C-suite executives and their direct administrative staff
Board members
Legal counsel (internal and external)
Human Resources personnel
Personnel on leave (medical, family, or other protected leave)

Coordination Protocol

Stakeholder	When	Purpose
Legal	Before RoE approval	Review pretexts for legal risk; confirm no violation of wiretapping, recording, or privacy laws
HR	Before RoE approval	Confirm no employee relations risk
Facility Security	Before physical or USB engagements	Coordinate access testing; prevent alarm triggers or law enforcement response
SOC / Detection Team	Before go-live	Prevent the engagement from being treated as a real incident
Incident Commander	Before go-live	Pre-brief: if engagement triggers detection, the IC knows it’s a test

Reporting Requirements

Social engineering findings are reported separately from technical findings. The report includes:

Engagement subtype and scope
Number of targets and interactions
Success rate per subtype and per department
Root cause themes (e.g., lack of verification procedure, urgency exploitation, authority bias)
Recommended awareness and control improvements
Comparison to prior social engineering engagements (if applicable)

9. OT Adversarial Assessment: Safety Protocol

OT testing is uniquely constrained. Safety overrides any test objective.

9.1 Prohibited Without Explicit Approval

The following are prohibited absent a documented exception with engineering and operations sign-off:

Active scanning of live SCADA/HMI/RTU/relay surfaces. Passive monitoring and engineering-supervised authenticated checks only.
Exploitation that could affect setpoints, alarms, relays, breakers, or control logic.
Denial-of-service against any OT segment, including saturation testing.
Disabling of OT monitoring or logging.
Persistence on OT systems.
Engagement during operational events: storms, demand peaks, planned outages, regulatory windows.

9.2 Permitted Approaches

IT-side validation of paths into OT (jump server, vendor remote access, IT/OT crossing).
Lab / digital-twin replication of OT systems for offensive testing.
Passive observation and documentation of OT exposure.
Engineering-supervised authenticated checks in a defined window.
Tabletop / scenario-based assessment with OT operators.

9.3 Authority and Safety Officer

Every OT engagement has a named OT Safety Officer (an operator with substation / process engineering authority) who can stop the test for any reason without justification. The Safety Officer’s “stop” is binding.

The Cost of Getting OT Testing Wrong

A pen-test misstep on an enterprise web app produces a ticket. A misstep on a relay setting group can shed load or trip a substation. CERG conducts OT adversarial work with the same care a power engineer applies in the field, and structures the engagement so the engineer’s call always wins.

10. Finding Rating, Routing, and Retest

10.1 Rating

Findings are rated using a uniform schema regardless of engagement type, aligning to CERG-PRC-VM-001 severity bands so they can co-route into exposure management:

Severity	Definition
Critical	Direct path to Crown Jewel, regulated data, safety impact, or persistent foothold.
High	Significant escalation or material exposure of sensitive data; widespread bypass of a control.
Medium	Notable control gap with limited exploitability or limited scope.
Low	Hygiene or hardening recommendation.
Informational	Observations and recommended improvements not tied to a control gap.

10.2 Routing

Source of Finding	Routes To
Specific software / asset vulnerability	`CERG-PRC-VM-001` finding + risk register entry per `CERG-PRC-RM-001` if Critical/High
Control / design weakness	Risk register entry directly per `CERG-PRC-RM-001`
Detection gap	Detection backlog + risk register if material
Process / runbook weakness	Owning procedure (AR / IR / TPRM) + risk register if material
Vendor / supply chain weakness	TPRM record per `CERG-PRC-TPRM-001`
OT-specific weakness	OT exposure management workflow per `CERG-PRC-VM-001` §4.3 and §6.3 + risk register

10.3 Retest and Closure

Critical findings retested within 30 days of remediation claim.
High findings retested within 60 days.
Medium/Low retested at next routine engagement or by sampled VM scan.
A finding moves to Closed only after retest demonstrates remediation; a finding that cannot be remediated moves through the risk register acceptance path with explicit Approver.
Closed findings inform the threat model for the affected system at next architecture review.

10.4 Escalation for Stalled Remediation

When remediation stalls beyond the retest window, the finding escalates.

Escalation Triggers

Trigger	Action
Critical finding not remediated at 30 days	Adversarial Testing Lead escalates to Risk Pillar Leader
Critical finding not remediated at 45 days	Risk Pillar Leader escalates to CISO
Critical finding not remediated at 60 days	CISO escalates to Executive; risk acceptance required
High finding not remediated at 60 days	Adversarial Testing Lead escalates to Risk Pillar Leader
High finding not remediated at 90 days	Risk Pillar Leader escalates to CISO
Medium finding not remediated at 120 days	Adversarial Testing Lead escalates to system owner’s pillar leader

Escalation Ladder

Adversarial Testing Lead → Risk Pillar Leader → CISO → Executive Sponsor

At each level, the responsible party must either (a) produce a credible remediation plan with a committed date, (b) initiate a risk acceptance through CERG-PRC-RM-001, or (c) provide a written rationale. Non-response at any level automatically escalates after 5 business days.

Risk Acceptance as the Only Alternative

When remediation is not feasible, risk acceptance requires: a written risk statement per PRC-RM-001 §4.1, documented compensating controls, approval per the authority matrix, and a scheduled re-review date. A finding that is neither remediated nor risk-accepted is a program gap, not an operational delay.

10.5 Adversarial Validation Final Report

Every engagement produces a Final Report - the authoritative record of what was tested, found, and recommended.

Required Sections

Section	Content
Executive Summary	Engagement type, scope, dates; findings summary by severity; top three risks; overall posture assessment; recommended next actions
Engagement Overview	RoE reference; scope (systems, networks, applications); out-of-scope items; timeline; tester(s); methodology summary
Methodology	Reconnaissance and enumeration techniques; exploitation approach; post-exploitation and lateral movement; tools and frameworks used
Findings	Each finding includes ID, severity, title, description, affected assets, evidence (annotated screenshots, command output), risk statement, remediation recommendation, references (CVE, ATT&CK, CWE)
Systemic Analysis	Patterns across findings; control themes that failed; root cause candidates
Trend Analysis (if applicable)	Comparison to prior engagements; findings closed since last engagement; new vs. recurring findings; trend direction
Appendices	Evidence artifacts, exploit code references (sanitized), timeline of key events, tools inventory

Evidence Artifact Standards

Screenshots: Annotated with arrows or captions highlighting relevant detail; include timestamp and host identifier
Command output: Full command and output with timestamp; truncation acceptable with notation
Packet captures: Time-stamped with source/destination; filtered to relevant conversation
Exploit code: Referenced by tool and version, not reproduced in full

10.6 Inter-Rater Reliability for Finding Ratings

Different testers may rate the same finding differently. The following process ensures severity ratings are consistent and defensible.

Peer Review for High-Severity Findings

All Critical and High findings receive a peer review by a second qualified tester before the finding is finalized. The peer reviewer validates: - The severity rating against the definitions in Section 10.1 - That the evidence supports the severity determination - That the finding is not a duplicate of a known issue

If the peer reviewer disagrees with the severity, the Adversarial Testing Lead facilitates resolution. The final severity is the consensus rating, or the higher of the two if consensus cannot be reached.

Calibration Sessions

Quarterly calibration sessions are conducted using a sample of anonymized findings from recent engagements. All testers independently rate each finding. Results are compared and discussed. The purpose is to: - Align severity interpretation across testers - Identify systematic over-rating or under-rating patterns - Refine severity definitions where ambiguity is found

The Adversarial Testing Lead facilitates calibration and documents outcomes. Calibration results inform tester development and, where appropriate, updates to this procedure’s severity definitions.

Dispute Resolution

Dispute	Resolution
Tester and peer reviewer disagree on severity	Adversarial Testing Lead facilitates; default to higher severity if unresolved
Tester and system owner disagree on severity	Risk Pillar Leader reviews; severity determined by risk to the organization, not by remediation difficulty
Tester and CISO disagree on Critical classification	CISO makes final determination; rationale documented

11. Systemic Analysis and Program Feedback Loop

An Adaptive program does not stop at fixing individual pen-test findings. It asks: why did four of six DMZ servers share the same misconfiguration? What standard or procedure failed to prevent this? How do we change the program so the next engagement does not repeat the same class of finding?

This section defines the post-engagement systemic analysis that converts adversarial validation findings into program improvements.

11.1 Post-Engagement Systemic Analysis

After every adversarial validation engagement, the Adversarial Testing Lead produces two outputs: a findings report (existing) and a systemic analysis (new). The systemic analysis answers four questions:

Which findings share a common root cause? Example: four findings relate to default credentials on newly provisioned systems : the provisioning process does not enforce credential rotation. This is not four separate problems; it is one systemic weakness manifesting in four locations.
Which findings indicate a standard or procedure gap? Example: DMZ web servers all had the same TLS misconfiguration : STD-CFG-001 or STD-NET-001 does not specify DMZ TLS requirements clearly enough.
Which findings indicate a control effectiveness gap? Example: WAF was deployed per CB-001 but three bypass techniques worked : the WAF control is Implemented but not effective (per CEF-001).
Which findings indicate a detection gap? Example: testers pivoted laterally for four hours without an alert : detection rules need updating for the observed technique chain.

11.2 Systemic Finding Routing

Each systemic finding is routed to the appropriate owner for program-level action:

Systemic Finding Type	Routed To	Tracking Mechanism
Standard or procedure gap	Governance Pillar Leader (Policy & Standards Manager)	IMPREG-001 (Type: Standard revision or Procedure update)
Control effectiveness gap	Accountable pillar leader (per CB-001)	IMPREG-001 (Type: Control amendment) + CEF-001 control failure analysis
Detection gap	Risk Pillar Leader (Detection Engineer)	Detection backlog per LM-001 + IMPREG-001 if systemic
Process gap	Accountable procedure owner (per RAC-001)	IMPREG-001 (Type: Procedure update)
Tooling or capability gap	Relevant pillar leader	IMPREG-001 (Type: New capability) + CISO if budget required

Routed systemic findings are tracked in the improvement register (IMPREG-001) per the improvement lifecycle. The source field records the engagement ID and date.

11.3 Prevention Verification

At the next adversarial validation engagement of the same type, scope, or target environment, the engagement plan includes re-testing the systemic weaknesses identified in the prior engagement.

If the same class of finding does NOT recur: the systemic improvement is verified Effective in IMPREG-001.
If the same class of finding recurs: the improvement is marked Ineffective or Partially Effective, re-opened in IMPREG-001, and the root cause analysis is re-examined. The fact that a deliberate program change did not prevent recurrence is itself a critical lesson.

The Recurrence Test

An assessor reviewing two consecutive pen test reports should see: (a) the specific vulnerabilities from the prior test are closed, AND (b) the systemic weakness that produced them no longer produces new findings of the same class. If the second report finds different specific vulnerabilities but the same systemic root cause : different default credentials on different systems : the improvement failed. The program that only fixes individual findings and never the systemic cause is operating at Repeatable, not Adaptive.

11.4 Annual Trend Analysis

Annually, the Adversarial Testing Lead produces an adversarial validation trend analysis covering all engagements in the trailing 12 months:

Are total findings increasing or decreasing? Segmented by severity.
Are systemic weaknesses repeating across engagements?
Which control families (per CB-001) produce the most findings? The most Critical and High findings?
Which control families show improvement (fewer findings year-over-year)?
How many systemic findings were routed to IMPREG-001, and what was their verification outcome?

The trend analysis feeds into: - The control effectiveness program (CEF-001): control families with persistently high finding rates trigger a control failure analysis - The risk appetite calibration (RMF-001): a finding-rate trend that undermines a risk appetite assumption triggers recalibration - The annual maturity assessment (MAT-001): finding-rate trends are evidence of control effectiveness or ineffectiveness

12. Roles and Responsibilities

Role	Responsibility
Adversarial Testing Lead	Owns this procedure. Produces the annual plan, engagement RoE/Charter, findings report, and systemic analysis. Routes systemic findings. Produces the annual trend analysis.
Risk Pillar Leader	Accountable for the adversarial validation program. Approves the annual plan. Reviews systemic analysis outputs.
CISO	Approves the annual plan. Authorizes red-team and OT engagements. Receives Critical finding notifications.
Engineering Pillar Leader	Receives and acts on design, architecture, and configuration systemic findings. Participates in scoping.
Governance Pillar Leader	Receives standard and procedure gap systemic findings. Coordinates catalog amendments resulting from systemic improvements.
Detection Engineer	Receives detection gap findings. Updates detection rules and reports on coverage improvement.
Exposure Management Lead	Receives specific vulnerability findings. Tracks remediation to closure per PRC-VM-001.
Risk Register Owner	Records material findings as risk register entries. Links risk entries to improvement entries when systemic.
OT Safety Officer	Named for each OT engagement. Holds binding stop authority. Reviews OT scope and safety constraints.

13. Evidence Retention

Artifact	Retention	Access
Signed RoE	Engagement + 7 years	Sponsor, CISO, Audit
Signed Charter (red team)	Engagement + 7 years	Sponsor, CISO, Audit
Engagement plan / scope	Engagement + 5 years	Sponsor, Test Lead, CISO
Tester daily logs / standup notes	Engagement + 2 years	Test Lead, CERG sponsor
Final report (sanitized)	Engagement + 7 years	Distribution per RoE
Final report (full / unsanitized)	Engagement + 5 years	Named distribution only
Detection validation results (purple)	Engagement + 3 years	Detection eng + audit
Finding records	Per VM tool retention; closed findings retained per audit retention	-
Retest evidence	Per finding record retention	-

Final reports are distributed under the confidentiality terms in the RoE. Full reports are not emailed; access is via a controlled location with audit logging.

Findings Carry Sensitive Information About How to Compromise Us

An unsanitized pen-test report is a road map for attackers and should be treated as Restricted. Distribution is named, encrypted, and audited; “wider sharing” requests are reviewed by the Engagement Sponsor.

14. Regulatory and Framework Alignment Summary

Regulation / Framework	Where in This Procedure
NIST 800-53r5 CA-8	All sections
NIST 800-115	All sections
PTES	Sections 3–4
MITRE ATT&CK	Sections 6, 7
OWASP WSTG / MASTG / API Top 10	Section 8
NERC-CIP CIP-007 R3.2 (vulnerability assessment)	Sections 9, 10
CMMC L2 (3.11.2)	Sections 3, 8

15. Document Control


Document ID	CERG-PRC-AV-001
Version	1.2
Approved By	CISO
Next Review	Annual / on material program change
Change Log	1.0 - Initial publication. Engagement types, RoE, charter, purple, cloud, app, OT safety, rating/routing, evidence retention. · 1.1 - Added Section 11 Systemic Analysis and Program Feedback Loop: post-engagement systemic analysis, systemic finding routing, prevention verification, and annual trend analysis. · 1.2 - Restored Section 12 Roles and Responsibilities. Renumbered Evidence Retention to Section 13, Regulatory to 14, Document Control to 15.

Source: procedures/CERG-PRC-AV-001_Adversarial_Validation_Procedure.md · Download .md · View on GitHub