Enhancing Onboarding: A Benchmark Study of New User Experiences

Enhancing Onboarding: A Benchmark Study of New User Experiences

Context

This was my second project at IBM. I collaborated with cross-functional partners and conducted baseline benchmarking to test a guided tour in Modeler Flows, a drag & drop Machine Learning (ML) tool.

Due to the nature of this project, I have omitted detailed information and am focusing on the process.

Role

Lead Reseacher

Toolbox

Research, Surveying, Usability Testing

Timeline

December 2021 - January 2022

Role

Lead Reseacher

Toolbox

Research, Surveying, Usability Testing

Timeline

December 2021 - January 2022

Role

Lead Reseacher

Toolbox

Research, Surveying, Usability Testing

Timeline

December 2021 - January 2022

Previous research found that it was hard learn Modeler Flows.

Recommendations from that research included:

  • Reducing number of concepts required for users to learn

  • Exposing more capabilities by default

  • Streamlining what information is provided and when

The Modeler Design team built a Guided Tour that introduced first-time users to Modeler by guiding them through the process of building a classification model.

Since the request for research came from the design team, I started by doing stakeholder interviews to learn more about Modeler and the Guided Tour.

After conducting brainstorming sessions with Design and PM, I led an official Research Kick-Off presentation that outlined the reasoning behind a benchmark study, the research plan, outstanding questions, and next steps.


Slides from research kick-off presentation that established research objectives and guidelines.

After working through multiple iterations of a research plan, I established the following to guide the research:

Objectives

  • Test Guided Tour to determine how effective it is at guiding users to build a classification model.

  • Establish baseline UX metrics to measure against in future iterations.

Methodology

  • Benchmark Study - Baseline

  • Moderated, between-groups tests: Modeler without Guided Tour & Modeler with Guided Tour

  • Collect performance measures via task-based testing with follow-up questions

  • Collect participant perceptions via surveys

Target Persona

  • Data Scientist with no prior experience with IBM Watson Studio

Number of Participants

  • Total: 16 (8 - Guided Tour & 8 - Without Guided Tour)

Behavioral Metrics

  • Time on Task

  • Task completion

  • Errors

  • Assists

Attitudinal Metrics

  • Single-ease questions (SEQ) - Pre & Post

  • System usability scale (SUS)

  • Net promoter score (NPS)

After working through multiple iterations of a research plan, I established the following to guide the research:

Objectives
  • Test Guided Tour to determine how effective it is at guiding users to build a classification model.

  • Establish baseline UX metrics to measure against in future iterations.

Methodology
  • Benchmark Study - Baseline

  • Moderated, between-groups tests: Modeler without Guided Tour & Modeler with Guided Tour

  • Collect performance measures via task-based testing with follow-up questions

  • Collect participant perceptions via surveys

Target Persona
  • Data Scientist with no prior experience with IBM Watson Studio

Number of Participants

  • Total: 16 (8 - Guided Tour & 8 - Without Guided Tour)

Behavioral Metrics

  • Time on Task

  • Task completion

  • Errors

  • Assists

Attitudinal Metrics
  • Single-ease questions (SEQ) - Pre & Post

  • System usability scale (SUS)

  • Net promoter score (NPS)

I created a screener targeting data scientists and posted it to 2 recruiting platforms.

Although I was targeting data scientists, I approved participants who had different job titles because they marked the necessary data science skills.

The moderated tests asked participants to complete 2 tasks, a survey, and answer post-task questions.

Procedure

  • 2 Tasks

  • Survey

  • Post-task Questions

Procedure
  • 2 Tasks
  • Survey

  • Post-task Questions

Task 1: Create a New Modeler Flow

Task 2: Build a classification model to predict which drug to give patients

I spent 2 weeks reviewing the notes, re-watching the recordings, and creating insights.

Initially, I spent lot of time rewatching recordings and trying to match up the notes to what I was seeing. In addition to filling in the gaps of the notes, I was affinitizing the data in Mural.

Virtual whiteboarding to answer research objectives

Because I structured the benchmark study to capture quantitative data, in addition to qualitative, I needed to figure out way to handle to metrics data. I created a notetaking template to guide my documentation of each participants user path as they worked to complete the tasks.

Note-taking template I created that outlined the Golden Paths to complete Tasks 1 & 2.

Participants, overall had an easier time completing Task 2, building a classification model to predict which drug to give patients, when they had the support of the Guided Tour.

Time on Task

For Task 2, that the Guided Tour helped first-time users build classification models faster than without the tour.

Task Completion

None of the participants in the "Without Tour” group were able to complete Task 2 without assistance, while 25% of the "With Tour" group completed the task within time and unassisted.

Errors

For Task 2, the "Without Tour” group had an average of 2 errors while "With Tour" group, on average, had 6 errors, which were errors specific to the tour.

Assists

The "Without Tour” group required assistance, on average, 2 times for Task 2, and the "With Tour" group required less than 1 assist.

Although the Guided Tour helped participants complete the tasks, there were still opportunities to improve the Guided Tour for new users.

I placed the issues within the tour in an issue severity matrix and created 5 themes to help communicate the findings and tell a compelling story. I learned that the workflow was a violation of user's mental model and modeler created map shock. The guided tour's UI was inconsistent and only provided a surface level introduction to Modeler Flow. Finally, new users perceived Modeler to be for beginners, when it is really intended for users of all skill levels.

5 Issue Themes

I presented the findings and recommendations to my cross-functional partners.

First-time use improvements

  • Consider looking at the workflow of other IBM products

  • Work with content designers to craft more user-friendly descriptions for Type node

Update Guided Tour UI

  • Update Guided Tour UI to consistently use Next or X buttons on all tour pop-ups

  • Update Guided Tour panel UI to number steps in tour

  • Update tour options and complexity

Multiple tours based on experience level

  • After tour: Provide documentation & extra resources

  • Consider end-to-end integrated tour across Watson Studio

The recommendations were implemented, and the tour was updated to address the UI issues.

I ran a follow up unmoderated A/B usability study to solve the problems around type node understanding.

Behavioral Note-taking Template

The most tedious part of this research was note-taking and mapping every user’s exact path to complete the task. I wish I had created the note-taking template sooner in the process, but it was all part of my journey to completing my first official user research project.

Qualitative Research

I spent a lot of time with the qualitative data, even though I decided not share it with my cross-functional partners. It was challenging to try to learn R Studio, but I had a lot of fun.

Contact

Have questions? Get in touch!

Contact

Have questions? Get in touch!

Contact

Have questions? Get in touch!

@ 2025 Designed & Created by Lesedi Khabele-Stevens using Figma & Framer

@ 2025 Designed & Created by Lesedi Khabele-Stevens using Figma & Framer

@ 2025 Designed & Created by Lesedi Khabele-Stevens using Figma & Framer