A live, beta-stage GenAI tool inside EY's Virtual Internal Auditor platform was failing adoption — not because the AI was wrong, but because the interface couldn't communicate what the AI could do. I led the redesign as a systems thinking exercise, not a UI cleanup.
When my team inherited GenIAus, it was already live in beta — a GenAI chat tool embedded inside EY's internal audit platform (VIA). On paper, it could answer auditor questions, analyze documents, and automate parts of the audit workflow. In practice, it was a collection of disconnected features stitched together without any design rationale.
The interface had no hierarchy. Navigation was misplaced — a global feature buried in a local sidebar. There were no suggested prompts, no onboarding, no visibility into what the system could actually do. Auditors were expected to figure it out by trial and error in a domain where mistakes carry regulatory weight.
The business problem was clear: EY had invested in building an AI tool for audit teams, but adoption was stalling because the people it was built for couldn't use it effectively. The gap wasn't in the AI's capabilities — it was in how those capabilities were surfaced, sequenced, and communicated.
The user problem ran deeper: Financial audit managers juggle hundreds of documents across multiple standards, phases, and review cycles. They needed a tool that reduced cognitive load, not one that added to it. The beta did the opposite — it required users to already understand what the AI could do before they could get any value from it.
Eight states from the engineer-built beta — annotated during contextual analysis. Yellow sticky notes mark every usability issue: misplaced navigation, missing prompts, broken scroll behavior, no system status, no chat history, no pagination, no copy. Every Nielsen heuristic violated, and the violations compounded each other. Step through each frame to see the failure mode.
GenIAus wasn't a greenfield project. It was a rescue operation with constraints on every side: a pre-built engineering-first UI, beta-stage instability, no AI design system to inherit from, a complex regulated domain with diverse users, and scalability pressure to integrate with VIA's broader ecosystem. Every design decision had to be defensible against what was already shipped to real users.
I made a deliberate choice early: this would not be a UI cleanup. The problems were structural, not cosmetic. I re-framed GenIAus from a collection of AI features into a risk-aware human–AI decision system — built on reusable interaction patterns, contextual AI behaviors, and embedded transparency.
Mapping the full auditor journey, not just the chat interface. Designing entry points that matched real task flows.
AI responses, prompts, and capabilities adapt based on what the user is doing and which audit phase they're in.
A modular architecture that accommodates new features (analytics, issue tracking) without restructuring the whole interface.
When the AI leads, when the human leads, how confidence is communicated, how user agency is maintained.
Transparency disclaimers, AI limitation disclosures, and accessibility — embedded as constraints, not afterthoughts.
In regulated domains, users won't adopt tools they can't explain to a regulator. Every design decision had to answer the question: "Could an auditor justify this to their oversight board?"
A structured 5-day design sprint moved the work from analysis to validated structure — divergent before convergent, lo-fi before hi-fi, structure before surface. The carousel below cycles through the sprint's working materials; it advances on its own, but pause to study a frame any time.
Day 1–2 · Understand & Define. Contextual analysis of the beta. Stakeholder interviews to map pain points, workflows, and expectations. Defined the core problem statement: "How might we redesign GenIAus so that its AI capabilities are intuitively understood and seamlessly integrated into the auditor's existing workflow?"
Day 3 · Ideate. Generated multiple flow variations on Miro — exploring different approaches to onboarding, file selection, prompt scaffolding, and capability communication. The key tension: how much structure to impose on a conversational interface without killing the flexibility that makes AI tools useful.
Day 4 · Decide. Evaluated iterations against three criteria: does it reduce time-to-first-value, does it scale to future features, can it be built incrementally without a full rewrite. Selected the flow that balanced all three.
Day 5 · Prototype. Built a clickable prototype in Figma — lo-fi first to validate structure, then hi-fi to test visual hierarchy and interaction patterns.
After the sprint, I led usability testing with audit professionals — the actual end users — using a moderated think-aloud protocol focused on Document Intelligence and Prompt Scaffolding. Insights were synthesized through thematic analysis, empathy mapping, and affinity mapping.
Labeling mattered more than we expected. Users consistently misunderstood feature names inherited from the engineering team. "Find in Files" didn't communicate what the feature actually did. Renaming to "Document Intelligence" with clear sub-descriptions immediately improved comprehension.
Users needed to see AI capabilities before they could use them. The empty chat state was a dead end. Adding structured entry points — Ask me anything, Document Intelligence, Q&A in Issue Tracker, Q&A in Analytics — gave users a mental model of the system's scope before they typed a single prompt.
File selection needed progressive disclosure. Showing 80+ audit files in a flat list overwhelmed users. Introducing an audit name dropdown first, then revealing the filtered file list with pagination and search, dramatically reduced decision fatigue.
Trust required transparency. Users in a regulated domain needed to know the AI's limitations upfront. Adding the disclaimer — that insights should serve as reference points, not definitive answers — wasn't just legal compliance. It was a trust-building mechanism that made users more willing to engage with the tool.
Each redesigned feature was a deliberate response to a specific insight — the problem, the design decision, and the reasoning behind it.
The beta dropped users into an empty chat with no guidance. Users with low AI literacy didn't know where to start. Users with high AI literacy didn't know what this specific AI could do.
A landing state that doubles as onboarding and navigation. Four capability entry points — Ask me anything, Document Intelligence, Q&A in Issue Tracker, Q&A in Analytics — each with a clear icon and label. Below: a transparency disclaimer that sets expectations about AI limitations.
It answers the three questions every new user has — what is this, what can it do, what should I do first — without requiring them to read documentation or watch a tutorial.
Users didn't know how to prompt the AI effectively. Enterprise users aren't ChatGPT power users — they needed guidance on what to ask and how to ask it.
Persistent contextual prompt suggestions surfaced at the bottom of the chat — tailored to the active feature mode and updated based on context. Tap a suggestion to start, then refine.
It transforms the chat from a blank text field into a guided interaction. The barrier to first interaction drops from "compose a prompt from scratch" to "select and customize."
The beta had no memory. Every session started fresh. Auditors working across multi-week engagements couldn't reference previous AI interactions, forcing them to repeat queries and lose context.
A collapsible sidebar showing recent chats (last 10 conversations) with preview text and the ability to resume any previous thread. New Chat button prominently placed for starting fresh.
It gives auditors the continuity they expect from any modern tool — and builds confidence. Users can see that their work persists, which increases willingness to invest time in detailed queries.
The beta's file selection was a flat, unsearchable list of every file in the audit. No filtering, no pagination, no feedback on selection.
A two-tab approach — "Use existing files" and "Upload a new file" — within a dedicated Document Intelligence section. Existing files use progressive disclosure (audit name first, then paginated list). Upload supports drag-and-drop with clear constraints. Step through the carousel to see all three states.
It separates two fundamentally different user intents (work with what's already in the system vs. bring in something new) and optimizes each path independently.
The beta showed all files in a single scrolling list with no way to filter, search, or paginate. Users doing focused analysis had to visually scan every row.
Redesigned tables with search, pagination, column headers (Name, Location, Source, Action), and inline actions (Generate Summary). A "Continue to Q&A" button provides a clear exit point once files are selected.
It respects the user's time. An auditor reviewing a specific standards-mapping document shouldn't need to scroll past 60 irrelevant files to find it.
The beta gave zero feedback during processing. Users clicked "Proceed" and stared at an unchanged screen, unsure if the system was working, broken, or finished.
A full-screen loading state with progress indicator and clear messaging. Background content visually dimmed to prevent interaction during processing. Toast notification confirms completion.
It's Nielsen's #1 heuristic for a reason. In an enterprise context where processing can take 30+ seconds, silence feels like failure. Visible progress maintains user trust and prevents duplicate submissions.
Different stakeholders — audit managers, compliance officers, CROs, IT risk specialists — need different types of analysis from the same data. A one-size-fits-all AI response style served no one well.
A persona modal accessible from the chat. Four personas with clear focus-area descriptions; selection changes the AI's response style and analytical lens mid-conversation. Persona prefix ("AM:") confirms which lens is active.
It's a UX solution to a prompt engineering problem. Instead of expecting users to frame queries differently for different perspectives, the interface handles the context-switching.
Even with prompt scaffolding, advanced users wanted access to proven, reusable prompts. The organization wanted to standardize how teams interacted with the AI to ensure consistency.
A searchable Prompt Collection modal with two tabs: "My Prompts" (personal, 7/25 used) and "System Prompts" (organization-wide, 25 available). Domain-specific, ready to run.
It bridges individual productivity and organizational standardization. Users get a head start on complex queries; the org gets consistency in how AI-assisted analysis is conducted across teams.
Redesigning a live product with 8 developers required a different kind of collaboration. Here's what worked.
Designing within technical reality. I sat with the engineering team to understand API constraints, processing limitations, and backend architecture before proposing interface changes. Every design decision was pressure-tested against what was actually buildable within the current sprint cycle.
Shared problem framing. Instead of presenting solutions, I brought the usability findings to engineering as shared problems. "Users can't find their files" is a problem everyone owns. "Add pagination to the file table" is a prescription only design owns. The former gets buy-in; the latter gets resistance.
Incremental refactoring strategy. We couldn't rebuild everything at once. I prioritized changes by impact-to-effort ratio: prompt scaffolding and onboarding shipped in Sprint 1 with minimal backend changes; Document Intelligence restructuring required API changes and shipped in Sprint 2; persona selection needed backend persona configuration and shipped in Sprint 3.
Design artifacts as alignment tools. The Miro boards, annotated screenshots, and lo-fi wireframes weren't just design process — they were communication tools. Engineers could see the rationale behind every decision, which reduced rework and misinterpretation during implementation.
Beyond the numbers: GenIAus moved from a beta experiment that audit teams avoided to a tool they actively requested access to. The design patterns established in this project — prompt scaffolding, persona-based AI interactions, progressive file disclosure — became the foundation for EY's broader AI product design standards.
More upfront co-design with auditors. The 5-day sprint was effective for speed, but I'd have benefited from embedding with an audit team for a full engagement cycle before designing. Some workflow nuances only surface when you watch someone do the actual work over days, not hours.
Earlier investment in a design system. I defined patterns as I went, but a dedicated sprint for establishing the AI interaction design system upfront would have reduced inconsistencies and accelerated later feature design.
Push harder on analytics integration. The Q&A in Analytics feature was scoped but not fully designed in my tenure. If I were continuing, that's where I'd focus — audit analytics is where the highest-value AI use cases live.
Transparency and guidance aren't nice-to-haves in AI products — they're structural requirements. In regulated domains, users won't adopt tools they can't explain to a regulator. Every design decision had to answer the question: "Could an auditor justify this to their oversight board?"
Context retention improves efficiency. Conversation history, file selection persistence, and persona memory aren't features — they're respect for the user's time.
The best AI interface design is invisible. When GenIAus works well, auditors don't think about the interface. They think about the audit. That's the goal.
I'm open to senior product design roles, advisory work, and selective collaborations. Whether you have a defined brief or a fuzzy problem space, let's talk it through.
damleaalvee@gmail.com