Expert Interviews
Talked to UX, engineering, Driver Distraction, and HMI experts at Mercedes-Benz to understand their thoughts and pain points around current adaptive UI work. Findings were grouped using thematic clustering.
In-car software today is built piece by piece, one use case at a time, slow to change and expensive to maintain. Generative UI replaces that with a single system: one library, many outputs.
Custom design and development for every use case.
Built at runtime from ready-made components. Same day.
Grows with each new use case. Every variant adds work to maintain.
One library covers all cases. Adding a new one costs almost nothing.
A few fixed personas. Set once at delivery, never updated.
Adapts to each driver and moment. Improves over time.
Each market is built separately. Versions drift apart.
One model, many surfaces. Variants are generated, not built.
Often missed or left out, too expensive to design for individually.
Covered automatically by the same system.
An AI-native framework that assembles infotainment UIs in real time from pre-validated components, guided by context and bounded by safety.
Pick a driver. Hit Run. The car streams its sensor signals into the model, the model thinks out loud, and the cabin screen assembles itself from atomic components, each pulled from a validated library and placed in response to specific signals.
The same machinery handles all three. The combinations of signals produce genuinely different UIs. That's the point.
A research-through-design (RtD) approach grounded in human-centered principles, moving from problem framing, through prototyping and benchmarking, into a controlled in-car evaluation with 30 drivers.
Talked to UX, engineering, Driver Distraction, and HMI experts at Mercedes-Benz to understand their thoughts and pain points around current adaptive UI work. Findings were grouped using thematic clustering.
Evaluated different LLMs on latency and ability to generate / adapt UI from a fixed component library. Each model received the same 8 atomic UI elements and was asked to generate / adapt the interface. We measured latency, from voice prompt to rendered UI. Gemini 2.5 Pro hit the best balance between adaptivity and latency for the use case.
Conceptualised and finalised the architecture for the refined system. The LLM produces structured JSON; a deterministic renderer maps that to safe, brand-compliant components.
Individual evaluation sessions with four designers using a workbook and a fixed task. Issues were rated 0–4 on severity. Issues rated 2, 3, and 4 were resolved before the user study.

AI-based system was compared against the pre-designed baseline using an A/B study. Both were integrated into the car's infotainment unit. Participants were split into four groups with randomised condition order to minimise ordering effects.
Quantitative analysis of the study data combined with qualitative insights to produce actionable design recommendations for constrained generative interfaces in safety-critical contexts.
The LLM is the gravity well. A bounded library, live context, and a constitutional system prompt all flow in. JSON flows out, gets validated, and only then becomes UI. Click any block for detail.
The LLM doesn't draw from scratch. It picks from this fixed set of brand-compliant, safety-validated components and fills them with contextual content. The structure of each element is locked; the AI controls which ones appear, where they go, and what data they carry.















Both systems were integrated into the same car's infotainment unit. Participants experienced both, in randomised order, to minimise ordering effects.

The AI-based UI performs as well as or better than the pre-designed GUI in usability (H1a), clarity (H1b), and informational value (H1c).
The AI-based UI causes equal or lower cognitive load (H2a) and distraction (H2b), and equal or higher situational awareness (H2c) than the pre-designed UI.
Participants were placed inside a continuous travel narrative: an early-morning drive from Böblingen to Munich for the IAA Mobility event. Within that drive, the session passed through three events designed to test three different interaction modes.
Across every measured variable, no statistically significant difference was found between the AI-assembled and pre-designed systems. Most participants didn't even notice a difference.
High variance was observed when the AI was given the most freedom in component creation. The interface accommodated the necessary information but not always with consistent structure across generations. How that affects users in the long term is an open question for future research.

AI showed strong capabilities in assembling and generating UI on the fly. But high variation in the output can compromise trust and safety. The pragmatic answer is hybrid: designers retain control of critical components, while AI handles non-critical, supplementary information that benefits from being personalised.
The work spanned design, concept development, and implementation. I combined UX research and design with system architecture and development to turn the idea into a functioning, in-vehicle prototype evaluated with real drivers.