Back to Advanced Workshop (Locked Tenant Reality)
Week 4.5 · Course 4.5 · Reality Track

Advanced Workshop
(Locked Tenant Reality)

Same six modules. Same frontier-mapping, complex-build, group-debugging, QA, teach-back, and playbook arc as Course 4. Different execution layer (static HTML) and a second enterprise AI tool on the desk (Ask Sage alongside genai.mil).

Length4 hours (with one 10-min break)
FormatLive workshop, bring builds in progress
PrereqCourse 3.5 or one deployed static-stack tool

What Course 4 taught. What 4.5 keeps. What 4.5 pivots.

Same arc. Different stack. One additional tool on the desk.

Course 4 taught

  • Frontier Mapping for your domain
  • Complex multi-component Build (Power BI dashboard)
  • Group Debugging clinic, five problems in 35 minutes
  • Verification protocols and QA drill
  • Teach-Back, the 201 multiplier
  • Workflow Playbook, one recurring task end to end

4.5 keeps the arc and pivots the stack

  • Keeps: six modules, same skills, same assessment rubric
  • Pivots: the Power BI/Power Apps/Power Automate build to single-file static HTML
  • Adds: Ask Sage as a peer enterprise AI tool alongside genai.mil
  • Adds: the meta-frontier (tool selection) as a first-class skill
  • Adds: the two-tool cross-check protocol for high-stakes work
Recap, Course 4 to 4.5
2

The Two-Tool World.

Two CUI-authorized enterprise AI tenants. Pick deliberately. New for 4.5.

genai.mil · fast chat

  • Use when: single-prompt drafts, quick code snippets, fast bounces, frontier classification
  • Strengths: low setup cost, fast turn time, always your default for one-shot work
  • Weakness: context window blows out across long multi-file work; constraints drift after many turns
  • Today: Build phases 2 and 3 prompts, debug bounces, teach-back prep

Ask Sage · multi-file reasoning

  • Use when: multi-file refactor, dataset analysis with multiple CSVs, architecture trade-offs, verification cross-checks
  • Strengths: file ingestion, reasoning models (Claude Opus 4.7 or a GPT-5 reasoning), holds schema across the conversation
  • Weakness: overkill and slow for one-shot drafts; higher setup cost per turn
  • Today: Build phase 1 architecture, phase 4 verification, the two-tool cross-check

Today's discipline: mode-switching (centaur vs cyborg) AND tool-switching (genai.mil vs Ask Sage). Two independent axes.

The Two-Tool World
3

Three layers. Two we always have. One we sometimes don't.

Carries from 3.5. Spec is now BOTH genai.mil and Ask Sage.

1. Spec

genai.mil for fast chat. Ask Sage for multi-file reasoning. Both CUI-authorized. Think, design, prompt, decompose, cross-check.

Always available where granted.

2. Prototype

Single-file static HTML. Deployed to a free public host MCEN can reach. Inline CSS, inline JS, no CDN.

Always available.

3. Production

Power Apps, Dataverse, Power Automate, real connectors, identity, write-back to systems of record.

Often blocked. The Capability Gap Map layer.

Work every layer we can reach. Document the wall where the work stops. The wall map is the case for changing what we have access to.
The Three Layers
4

Today's mission. Three deliverables.

By 4 hours from now, every Marine in this room produces all three.

  • Deployed Readiness Dashboard: a single-file static HTML tool with three joined data sources, hand-rolled SVG bars, Not-Ready table, and a plain-text Commander's Snapshot. Public URL.
  • Advanced Frontier Map: at least three rows specific to what bit you today, split by tool (genai.mil column, Ask Sage column).
  • Capability Gap Map entries: at least two new rows mapping your Workflow Playbook to a production-layer wish with the specific blocking permission named.

The tool is replaceable. The judgment about which tool to use, and how to verify what it tells you, is what we are actually teaching.

Today's Mission
5

Agenda · 4 hours.

Six modules. One 10-minute break. The complex build is heaviest by design.

TimeModuleDurationMode & tool focus
0:00 to 0:30M1, Frontier Mapping for your domain30 minWorkshop · both tools
0:30 to 1:30M2, Complex Build, Readiness Dashboard60 minLive build · both tools
1:30 to 1:40Break10 min
1:40 to 2:20M3, Group Debugging40 minClinic · 5 scenarios
2:20 to 2:50M4, Verification & QA, two-tool cross-check30 minReference + drill
2:50 to 3:20M5, Teach-Back, the 201 multiplier30 minWorkshop
3:20 to 3:50M6, Workflow Playbook + Capability Gap Map30 minWorkshop
3:50 to 4:00Wrap, rubric, next steps10 min
Agenda
6

Refresher. 201 skills plus frontier classification.

Quick calibration before Module 1. Everyone in this room already has these.

The six 201 skills

  • Context Assembly
  • Quality Judgment
  • Task Decomposition
  • Iterative Refinement
  • Workflow Integration
  • Frontier Recognition

Three categories of failure

  • Context: AI did not have enough information. Fixable with a better prompt.
  • Frontier: the model cannot do this task reliably right now. Document on the map.
  • Meta-frontier: wrong tool reached for. Fixable by switching tools. New for 4.5.
Refresher
7
Module 1 of 6

Frontier Mapping for Your Domain.

30 minutes

Make the boundary between AI helps and AI hurts visible for your specific work, in writing, shareable, split by tool.

Open instructor notes for this section ↗
Module 1 · Frontier Mapping
8

Why this map is the most valuable thing you'll make today.

Performance drops 19 percentage points when workers apply AI outside the frontier vs working without AI at all. BCG-Harvard, 758 consultants, 2023.

What that means today

  • The boundary is real, and it is not where intuition puts it.
  • People do not notice when they cross it. Quality collapses silently. The output still looks right.
  • Each tool has its own boundary. genai.mil and Ask Sage have different frontiers in your domain.
  • The map makes the boundary visible to your section.
  • Most valuable artifact you will leave with today.
Module 1 · Why It Matters
9

Tool Handles · what each tool is good for.

Two columns, one per tool. Yours will look like this for your own domain.

Task genai.mil handles Ask Sage handles
Single-prompt drafts (counseling, awards, memos) Yes. Fast chat is the right tool. One prompt in, one polished draft out. Overkill. Works, but the file-ingestion overhead is not justified for one-shot drafts.
Multi-file refactor of a static-stack tool Painful. Context window blows out across iterations. Common to see the model lose track across 20+ turns. Yes. Upload the files, ask Claude Opus 4.7 (or another reasoning model the tenant exposes) for a targeted refactor with constraints stated up front.
Quick code snippet (regex, date math) Yes. Bounce a single-line prompt, paste the result, verify. Works, but slow for what should be a 10-second answer.
Dataset analysis with three CSVs (readiness, training, equipment) Limited. Can do it with paste-and-summarize on small samples. Loses fidelity at scale. Yes. Upload all three files. Use a reasoning model. Holds schema across the conversation.
Architecture decision (data model, schema migration plan) Possible for small scope. Reasoning quality drops on larger trade-off questions. Yes. A reasoning model produces the best architecture analysis the locked-tenant world currently exposes.
Verification cross-check (same prompt through both, look for disagreement) Half of the two-tool cross-check. Use for the fast side. Half of the two-tool cross-check. Use for the deliberate side. Disagreements are flags.
Module 1 · Tool Handles
10
Workshop · Switch to editor · Now

Open your Frontier Map template. Start filling.

15 minutes · silent work, then a 10-minute live debrief
Prompt Open a blank doc. Three columns: Task · genai.mil handles · Ask Sage handles. Pick five task types that cover what your section actually does this month. Fill at least one specific, real example in every cell. Issues you have personally seen succeed or fail. If you cannot tell yet, write "untested" and add it to your re-test list for next quarter.
What good looks like in 15 minutes: 5 rows, every cell has at least one specific task (not a topic), and you can name at least one task where the two tools differ. If both columns say the same thing for every row, you have not stressed the boundary yet.
Module 1 · Build Your Map
11

Debrief · one issue per student, categorized live.

10 minutes · ~60 seconds each · we are listening for patterns

What to share

  • One task where one tool clearly handles it and the other clearly does not.
  • One task where you would now reach for a different tool than you did last month.
  • One issue where the failure was tool-selection, not the model itself (the meta-frontier).

How I categorize it on the wall

  • Context → fix the prompt
  • Frontier → on the unit map
  • Meta-frontier → on the unit map AND in the Workflow Playbook
  • Platform quirk → on the unit map
When you can name where AI fails in your domain, you stop trusting it there. Knowing which tool to reach for first is the new advanced skill.
Module 1 · Debrief
12
Module 2 of 6

Complex Build.
Readiness Dashboard.

60 minutes

Single-file static HTML. Three joined data sources. Hand-rolled SVG bars. Two mode switches and at least one tool switch by design.

Open instructor notes for this section ↗
Module 2 · Complex Build
13

Build target · static HTML Readiness Dashboard.

Single file. Inline CSS. Inline JS. No external scripts. No CDN.

Inputs and outputs

  • Three data sources: personnel, training, equipment (JSON or CSV)
  • Joined in JavaScript by EDIPI
  • Overall readiness percentage card
  • Hand-rolled SVG bar chart by company
  • Not-Ready table
  • Plain-text Commander's Snapshot with Copy button
  • localStorage persistence so reload does not lose state

You will switch modes 2x AND tools 1x

  • Phase 1: Centaur, Ask Sage with reasoning model (architecture)
  • Phase 2: Cyborg, your choice (ingestion + join)
  • Phase 3: Centaur, your choice (visualization)
  • Phase 4: Verification, Ask Sage cross-check + genai.mil brief generation

If you finish, the dashboard is a bonus. The grade is mode-switching AND tool-switching.

Reference build: builds/readiness-dashboard.html. The destination. Check your work, do not copy-paste from it.

Module 2 · Build Target
14

Phase 1 · Data Architecture.

15 min · Centaur · Ask Sage with a reasoning model

Why this mode

Centaur: data schema errors compound. A bad join model means rework everywhere downstream. Slow down. Verify.

Why this tool

Ask Sage: reasoning quality on architecture trade-offs is materially higher with a reasoning model than with fast chat. The right tool wins the next hour for you.

The work

  • Upload three sample CSVs to Ask Sage: personnel, training, equipment.
  • Select Claude Opus 4.7 (or whichever reasoning model the tenant exposes).
  • Whiteboard the join model with the AI as your peer reviewer.
  • Confirm three edge cases out loud before you write a line of HTML.

Checkpoint: every student names three edge cases before leaving Phase 1.

Module 2 · Phase 1
15

Phase 1 prompt · verbatim to Ask Sage.

Reasoning model selected. Three CSVs uploaded. One prompt.

Whiteboard prompt · Ask Sage · Phase 1

You are helping me design a Unit Readiness Dashboard. I have three CSV datasets (personnel, training, equipment) I am about to upload. I need you to:

  1. Propose a join model that produces a readiness percentage per Marine and aggregated by company.
  2. Identify three edge cases that will break the naive join (orphaned EDIPIs, NULL training records, unassigned equipment).
  3. Recommend the in-memory data shape after parsing. Plain JS objects, no framework, single-file HTML.

Constraint: the final tool must run 100% offline in a single HTML file with no external scripts or CDN.

Module 2 · Phase 1 Prompt
16

Phase 1 verification · three edge cases out loud.

Walk the room. No student leaves Phase 1 without naming three.

Edge cases most rooms hit

  • Orphaned EDIPIs (a Marine in personnel with no training record)
  • NULL training records (the field exists but is empty)
  • Duplicate equipment serial numbers
  • Expired training certifications
  • Marines listed in personnel but marked NJP or unfit
  • Cross-attached Marines appearing under two units

If a student is stuck

One more prompt back to Ask Sage:

Edge-case extension prompt

List five more edge cases I am not thinking of, ranked by how silently they would break the readiness percentage.

Module 2 · Phase 1 Verification
17

Phase 2 · Data Ingestion and Joining.

15 min · Cyborg · the explicit tool-choice beat

Why this mode

Cyborg: ingestion is trial-and-error. Parse errors, schema drift, edge cases revealed only by running the code. Stay in continuous conversation.

Student choice on tool

  • genai.mil chat: paste-and-parse. Fast iteration, lower setup cost.
  • Ask Sage: upload the CSVs, generate the join code with real schema knowledge. More deliberate, higher upfront cost, fewer rewrites.

Both are valid. The point is that you can articulate why you chose yours.

The teaching beat

Course 4 students switch between centaur and cyborg modes. Course 4.5 students switch BOTH modes AND tools. The two axes are independent. You can be in cyborg mode on genai.mil (fast chat, fast iteration) OR cyborg mode on Ask Sage (fewer turns, larger context per turn).

Make the framing explicit. Watch students discover the second axis.

Module 2 · Phase 2
18

Phase 2 deliberate failure · let students hit it.

Do not warn. Do not correct. Let the AI fail. Debrief frontier-or-context.

Prompt 2 · genai.mil chat

Write JavaScript that parses a JSON array of personnel records and a JSON array of training records. Join by EDIPI. Output an array of {marine, training_status} objects. Inline JS, no dependencies, single function.

What the AI almost always returns

  • An O(n^2) array.find() join that visibly stalls on 500+ records.
  • A JSON.parse call with no try/catch that crashes on the first malformed paste.
  • A join that silently drops orphans without flagging them.

Debrief: frontier or context?

Honest answer: context. The fix is to tell the AI it is getting 500+ records and ask for an indexed Map join, plus a try/catch with a visible error fallback.

Surface the lesson out loud. Most "AI failures" in this room today will be context failures, not frontier failures.

Module 2 · Phase 2 Failure
19

Phase 3 · Visualization.

20 min · Centaur · the SVG-vs-Chart.js teaching beat

Prompt 4 · first pass

Build a horizontal bar chart of readiness by company. Company on the left axis, percentage as the bar length, value label on the right. Colors: scarlet for under 75, gold for 75 to 90, ink for above 90.

What fast chat almost always returns

  • A <script src="..."> pulling Chart.js from a CDN.
  • That violates the no-external-dependencies constraint stated up front.

Let the student catch it. If they do not: "Where is that chart actually coming from? Open DevTools. What is the Network tab showing?"

The reprompt

Phase 3 forces the student to restate the constraint with more force. This is the second deliberate failure target of the build. Next slide is the verbatim reprompt.

Module 2 · Phase 3
20

Phase 3 reprompt · verbatim.

Stronger constraint. Targeted output format. Stylesheet palette referenced.

Prompt 5 · constraint refresh

No external scripts, no CDN, no Chart.js. Render as inline SVG using vanilla JS. Bars labeled with company name on the left and percentage on the right, scaled to 0 to 100. Use the scarlet/gold/ink palette already in the stylesheet.

When this prompt is run through Ask Sage with a reasoning model, the constraint is honored more reliably from the first response. When run through fast chat, the constraint gets violated and the student has to catch it. The contrast IS the lesson, and the next slide names it.

Module 2 · Phase 3 Reprompt
21

Reasoning models honor constraints. Chat models drift.

A key teaching beat for the day. Both tools have a place.

genai.mil · fast chat

  • Cheaper per turn. Faster bounces.
  • Pushes the verification burden onto you.
  • Constraints drift after many turns or when not restated.
  • Best for tasks where the verification cost is low and the iteration speed pays off.

Ask Sage · reasoning model

  • More expensive per turn. Slower bounces.
  • Honors stated constraints more rigorously.
  • Holds schema and file context across the conversation.
  • Best for architecture, multi-file refactor, and verification cross-checks where the cost of wrong is high.
Neither is wrong. Use both. Pick deliberately. That picking IS the new advanced skill.
Module 2 · Tool Contrast
22

Phase 4 · Verification and Commander's Snapshot.

10 min · Ask Sage cross-check, then genai.mil brief generation

Prompt 6 · Ask Sage verification cross-check

I am uploading two files: readiness_output.csv (what my dashboard generated) and personnel_source.csv (the original data). For each Marine in readiness_output.csv flagged as NOT READY, verify the reason by cross-referencing the source. Flag any rows where the source data does not support the NOT READY classification.

Prompt 7 · genai.mil chat · Commander's Snapshot

Add a "Generate Commander's Snapshot" button. It produces a plain-text brief in a read-only textarea. Format: header with unit and as-of date, overall readiness percentage with delta vs last week from localStorage, breakdown by company, top three NOT READY drivers, and a one-line REQUEST at the end if any company is below 75. Add a Copy button next to it.

Final standard: pick one Marine from the Not-Ready table. Open the source CSV in a text editor. Confirm the dashboard's reason matches the source.

Module 2 · Phase 4
23

Module 2 debrief.

5 minutes · mode switches, tool switches, AI failures, time saved

  • Where did you switch MODES? Why there?
  • Where did you switch TOOLS? Why there?
  • Where did the AI fail? Was it a frontier issue, a context issue, or a meta-frontier (tool-selection) issue?
  • If you had to build this again tomorrow, what would you do differently?
  • How long did this take? How long would it have taken without AI?
7+ prompts. At least 2 mode switches. At least 1 tool switch. At least 1 error-recovery cycle. That is normal for advanced builds. The students who succeeded verified at each phase boundary AND picked the right tool for each phase.
Module 2 · Debrief
24

BREAK

10 minutes · back at the time on screen
Break
25
Module 3 of 6

Group Debugging.

40 minutes

Five problems. Seven minutes each. We fix the diagnostic pattern, not necessarily the tool.

Open instructor notes for this section ↗
Module 3 · Debugging
26

Clinic Protocol · same four steps per problem.

7 minutes total per scenario · this slide stays up

Step 1 · 2 min · Student presents

Format: "Expected behavior. Actual behavior. Steps I have already taken." No story-telling. No theories. Three lines.

Step 2 · 3 min · Group diagnosis

Clarifying questions and hypotheses. I prompt: data or logic? Input or output? Pattern we have seen before? Was the right tool used for this work?

Step 3 · 2 min · Instructor synthesis

I name the root-cause category: frontier limitation, insufficient context, incorrect assumption, integration failure, data quality, OR tool-selection failure (meta-frontier). New for 4.5.

Step 4 · in chat · Document the pattern

Failure case goes on the collective Advanced Frontier Map. Pattern is the deliverable, not the fix.

Time allocation: 7 min per problem × 5 problems = 35 min. 5 min final synthesis.

Module 3 · Clinic Protocol
27

Scenario 1 · Fetch race / double-submit.

Counseling Tracker tool

Symptom

A counseling tracker tool double-saves the same entry whenever the user clicks Save twice quickly. Two identical rows in localStorage, both with the same timestamp.

Student fixes already tried

  • Added a confirm dialog
  • Added setTimeout debounce
  • Neither worked

Diagnostic pattern

Async handler race + missing in-flight guard. The save function fires twice before the first write completes. The fix is a disabled-while-pending button state plus an idempotency key.

Category: context. AI was not told "this handler can be reentered." Fixable in the prompt.

Full answer key →

Module 3 · Scenario 1
28

Scenario 2 · Stale localStorage after schema change.

TEEP-like Tracker

Symptom

Tool was updated to add a "Confirmed By" field. New entries save correctly. Old entries throw a JS error when displayed because the property does not exist on legacy rows.

Student fixes already tried

  • Rebuilt the render function three times
  • Each rebuild handled the new entries cleanly
  • Each rebuild still crashed on the old entries

Diagnostic pattern

Missing schema migration. The fix is a one-time backfill on load (default the new field to a safe value when reading legacy rows) plus a schema version number in localStorage so the next migration is forced.

Category: context. AI was not told the localStorage shape changed between versions. Fixable in the prompt.

Full answer key →

Module 3 · Scenario 2
29

Scenario 3 · CSS specificity + timezone bug.

Watch-Bill highlight rows. Two bugs stacked.

Symptom

Watch-bill tool should highlight rows for the current day in gold. The wrong row highlights, and only on Mondays.

What is actually wrong

  • The gold class is overridden by a more specific selector
  • Day-of-week comparison uses local time
  • Rendered dates use UTC
  • On Mondays the UTC date is still Sunday

Diagnostic pattern

Two independent bugs on the same symptom. Fix one and the bug looks "different" but does not go away. The discipline is to bisect: kill the specificity collision first (use class-based styling consistently), THEN address the timezone.

Category: context + frontier. The CSS specificity is context. The timezone-vs-UTC is a frontier issue date-math AI gets wrong without explicit constraints.

Full answer key →

Module 3 · Scenario 3
30

Scenario 4 · Event delegation on dynamic content.

List-based tool with Add and Delete

Symptom

Tool renders a list of items with Delete buttons. The first three Delete buttons work. After clicking Add and rendering four more items, only the original three buttons respond.

Diagnostic pattern

Click handlers were attached at render time. New items never got them. Fix is event delegation on the parent container (single listener that reads event.target) or re-attach handlers on every render.

Category: context. AI was not told the list is re-rendered after Add. Fixable in the prompt.

Full answer key →

Module 3 · Scenario 4
31

Scenario 5 · Tool-selection failure.

New for 4.5. The meta-frontier scenario.

Symptom

Student used genai.mil chat for a complex multi-file refactor of a static-stack tool. After 47 turns, the model has lost track of which files changed and is reintroducing bugs that were fixed earlier in the thread.

Diagnostic pattern

The correct move was Ask Sage with file ingestion plus a reasoning model from the start. This is not a frontier issue with the AI. It is a meta-frontier issue with the human's tool selection.

Category: meta-frontier. The fix is tool selection, not prompt refinement. Switch to Ask Sage, upload the files, restate the goal once.

Full answer key →

This category did not exist for Course 4 students. It does for Course 4.5 students, because they have two tools to pick between.
Module 3 · Scenario 5
32

Module 3 synthesis · three patterns and a diagnostic instinct.

5 minutes · before we move to Module 4

The three patterns we just saw

  • State mismatch: the AI did not know what the world looks like right now (legacy localStorage, in-flight handler, re-rendered DOM)
  • Wrong primitive: the AI reached for a primitive that does not match the constraint (inline style vs class, find vs Map, local time vs UTC)
  • Wrong tool: the AI was the wrong tool for the job in the first place (chat where a reasoning model was needed)

The two-step diagnostic instinct

  1. Is this fixable with a better prompt? If yes, it is context. Rephrase and continue.
  2. Is this fixable with a different tool? If yes, it is meta-frontier. Switch tools and continue.

Only if both are no, it is a real frontier limit and goes on the map.

Most debugging is context, platform quirks, or wrong-tool. The human can fix all three. The fourth category (frontier) is what goes on the map.
Module 3 · Synthesis
33
Module 4 of 6

Verification & QA.

30 minutes

A QA checklist, a timed planted-error drill, and the two-tool cross-check protocol that is new for 4.5.

Open instructor notes for this section ↗
Module 4 · Verification & QA
34
Workshop · Timed drill

QA Checklist · then find the five errors.

10 min silent · 10 min debrief

The five QA protocols

  1. Source verification. Every citation, regulation number, NAVMC form, and URL independently verified.
  2. Data accuracy. Numbers, dates, names, ranks, quantities checked against source.
  3. Logic check. Reasoning holds end to end. Conclusions supported by premises.
  4. Format compliance. Matches required formats, templates, and standards.
  5. Domain review. Smell test from the SME. The one AI cannot do for you.
Drill prompt

I am sharing a one-page AI-generated SOP excerpt in chat now. Run it through all five protocols. Mark every issue you would not sign. Number them.

Five planted errors: two fabricated references, one contradictory timeline, one logic error, one format break. Find all five.

If you only find three, you are skipping a protocol. Usually source verification, because it requires looking things up. Looking things up IS the protocol.

Module 4 · Timed QA Drill
35

Two-Tool Cross-Check Protocol.

New for 4.5. For high-stakes outputs, run the same prompt through both tools.

When to run it

  • Anything that goes to a CO
  • Anything that becomes a system of record
  • Anything that ends up in a fitness report
  • Any data product where wrong numbers have downstream consequences

How to run it

  1. Same prompt to genai.mil.
  2. Same prompt to Ask Sage with a reasoning model.
  3. Compare the two answers side by side.

What the result means

  • Both agree on a substantive judgment → confidence increases (not to certainty, but materially).
  • Tools disagree → that is a flag. Dig in. The disagreement is data about where the model is uncertain.
  • Systematic disagreements → frontier-map row, probably a Capability Gap Map row.

Cost: two prompts and a comparison turn. Reserve for outputs that will be acted on.

The GDPval study showed AI-assisted workflows matched expert quality on roughly half of tasks and ran 1.4x faster / 1.6x cheaper at expert parity in those scenarios. The two-tool cross-check is one of the cheapest ways to raise the floor on review quality.
Module 4 · Two-Tool Cross-Check
36
Module 5 of 6

Teach-Back.
The 201 Multiplier.

30 minutes

You leave this room as the person who teaches the next two Marines. Tool selection is part of what they will need to learn.

Open instructor notes for this section ↗
Module 5 · Teach-Back
37

The permission gap, the apprentice problem, and the junior protocol.

Mollick: workers using AI but hiding it. Entry-level postings in AI-exposed roles dropped roughly 35% from 2023 to 2025.

The permission gap

Workers already using AI but hiding it. Worried about organizational reaction. Shadow culture, no shared practice. Your role as a 4.5 graduate is to formalize use, share techniques, and train others. Including tool selection.

The apprentice problem

If a junior never manually writes a key document because AI generates it, how do they develop the judgment to know when the AI version is wrong? And how do they learn which tool to reach for when the task gets harder?

Protocol for junior Marines

  • Require review and explanation · they defend WHY AI's output is right or wrong
  • Periodically work without AI · key tasks done from scratch monthly
  • Use AI output as a teaching tool · hand them a flawed draft, have them find issues
  • Rotate them through QA · judgment from exposure to bad output
  • Teach tool selection deliberately · when a junior asks for help, ask which tool they reached for first and why. Coach the selection, not just the prompt. New for 4.5.
Module 5 · Permission Gap
38
Workshop · Teach-back

Teach one concept · 3 minutes.

2 min pick · 5 min prep · 10 min round-robin in breakouts · 5 min full-room debrief

Pick one concept

  • Centaur vs Cyborg modes
  • Frontier mapping
  • Context-building in prompts
  • Iterative refinement
  • Verification protocols
  • The Jagged Frontier
  • Layer Separation (Spec / Prototype / Production) · new
  • Tool Selection (genai.mil vs Ask Sage) · new
Template
  • One-sentence definition
  • Why it matters
  • Real example from your job
  • One common mistake
  • Key takeaway

Peers grade you on three things: could they explain it back, was the example concrete enough to be believable, do they know what to do differently tomorrow because of you.

Module 5 · Teach-Back
39
Module 6 of 6

Workflow Playbook.
Capability Gap Map.

30 minutes

One page. One recurring task. The deliverable that proves you graduated. Plus two new Capability Gap Map rows that go to leadership.

Open instructor notes for this section ↗
Module 6 · Playbook
40

Workflow Playbook template · with the new "Which tool" column.

Two worked examples. Same template. Different tool selection.

Worked example 1 · genai.mil chat

TaskWeekly training schedule for the section
Frequency / ModeThursdays by 1600 · Cyborg
Which toolgenai.mil chat
Steps (H/AI)1. H: pull events. 2. AI: draft schedule. 3. H: cross-reference range/vehicle/instructor. 4. AI: format conflicts as decision matrix. 5. H: decide, add notes. 6. AI: generate final.
VerificationLocations confirmed · 24-hour times · no double-bookings · uniform specified · POC listed
Time savings3 hours → 45 minutes

Worked example 2 · Ask Sage + Claude Opus 4.7 · new

TaskWeekly readiness rollup for the company commander's brief
Frequency / ModeMondays by 1000 · Centaur
Which toolAsk Sage with Claude Opus 4.7 (or the tenant's reasoning model)
Steps (H/AI)1. H: export three rosters from MOL. 2. AI: upload to Ask Sage, compute deltas vs last week, top three drivers. 3. H: spot-check three new NOT READY Marines against source. 4. AI: generate plain-text rollup with one-line recommended action. 5. H: log to dashboard for next week's delta.
VerificationOverall % matches manual count · delta sign and magnitude vs last week · top three drivers trace back to source · two-tool cross-check on cross-attached Marines
Time savings4 hours → 1 hour
Module 6 · Worked Examples
41
Workshop · Build now

Write your playbook. Add two Capability Gap Map rows.

20 minutes · silent work · submit in chat by the timer

Your playbook

Pick a real, recurring task. Fill the nine fields: task, frequency, mode, which tool, 4 to 8 H/AI steps, verification checklist (3 to 5 items), known frontier issues, time savings, junior development note.

Completion bar: 4+ steps with H/AI labels, verification checklist with 3+ items, at least one real frontier issue, time savings backed by your own data, an actionable junior development note.

Capability Gap Map rows (2+)

Map your playbook to production-layer wishes. Examples from the readiness rollup:

  • MOL write-back so the snapshot posts back to the system of record
  • Automated brief delivery once the rollup is approved
  • Direct connector to the training currency database so the manual export is eliminated

Open the Capability Gap Map template and fill rows now, while the workflow design is fresh.

Module 6 · Worktime
42

What you produced today.

Four artifacts. All of them outlive the session.

On the floor when you leave

  • Readiness Dashboard: single-file static HTML, three data sources joined, SVG bars, Not-Ready table, Commander's Snapshot. Public URL.
  • Advanced Frontier Map rows: at least three rows specific to what bit you today, split by tool.

Goes back to your section and up the chain

  • Workflow Playbook: one page, one workflow your section can run without you in the room. With the "Which tool" column filled.
  • Capability Gap Map entries: at least two new rows mapping production-layer wishes to specific blocking permissions. Goes to the AI POC, then HQMC.
Three deliverables to the unit. One artifact to leadership. Compound across sessions.
Wrap · Artifacts
43
Closing

Power Apps skills die
when you turn in your CAC.
HTML skills don't.

Knowing when to switch tools makes both skills last.

Artifacts dueDashboard URL, Playbook, two Gap Map rows
Assessment5 of 6 categories Meets-or-better
NextForward Week 5 invite up the chain