Design Thinking Research 2016
Design Thinking Research 2016
Hasso Plattner
Christoph Meinel
Larry Leifer Editors
Design Thinking
Research
Making Design Thinking Foundational
Understanding Innovation
Series Editors
Christoph Meinel
Larry Leifer
More information about this series at
http://www.springer.com/series/8802
Hasso Plattner • Christoph Meinel • Larry Leifer
Editors
Larry Leifer
Center for Design Research (CDR)
Stanford University
Stanford, CA
USA
It is now more than 10 years ago that the Stanford d.school was officially founded. It
did not need much persuasion for me to engage in this unique endeavor, as I was
instantly electrified by David Kelly’s vision of a hub for innovators on Stanford
campus where students and faculty from all departments come together. Looking
back, thousands of students with diverse backgrounds at Stanford as well as at the d.
school’s sister institute at HPI Potsdam have learned and experienced how to tackle
wicked problems and complex challenges to come up with innovative, human-
centered solutions.
    Ever since I came into contact with design thinking, I have been convinced of its
innovative power. I have seen design thinking bring about countless examples of
unexpected solutions, changes in working cultures, and improvements in team
performance. Something, traditional approaches usually simply cannot deliver.
They often fall short when it comes to complex and wicked problems which require
new approaches and creativity—a skill that is often assumed to be the domain of
outstanding individuals. But: Creativity is not just a skill of the chosen few; big
ideas do not merely happen; the creative spark does not just ignite on command.
That’s not how we can find solutions—especially not in a world as complex as ours.
Rather, creativity is a process that everyone can implement. And that’s what design
thinking is all about: it’s a framework and method that fosters creative confidence
which is foundational for innovation and can ultimately enable everyone to be
innovative.
    Innovation requires curiosity and an open mind. Design thinking is thinking in
terms of opportunities, not restrictions or prohibitions. It is a holistic approach and
encourages thinking across boundaries, thereby enabling real and fundamental
innovations. Once in contact with design thinking, people experience a sustainable
shift in their mindset and how they act and think. There is also a shift in the way
people approach challenges. The innovation method design thinking views prob-
lems from a human perspective, with the objective of designing innovative prod-
ucts, services, or experiences that are technically feasible, economically viable, and
desirable for the target group.
                                                                                     v
vi                                                                            Preface
                                                                                                            vii
viii                                                                                              Contents
The idea that design thinking is foundational for engineering and, therefore, in an
engineering education, is derived from the following five questions. These propo-
sitions need to be taken together and in context because in all things human centric
(business, design, society, . . .) the knowledge worth having is and must be context
dependent. This point of view balances the equation with physics and math,
whereby the knowledge worth having must be context independent.
(1) How Might We Address and Measure the Needs of Society?
While engineering has been described as the application of science and mathemat-
ics to the needs of society, up until now we have known and taught our students very
little about finding and understanding the needs of society. Thus we have indirectly
turned out half-built engineers. These were engineers who could only serve the
explicit needs of others without a direct understanding of who had the need and
why. With the advent of the design thinking paradigm, this situation has changed.
L. Leifer
Stanford Center for Design Research, Stanford University, Panama Mall 424, Stanford
94305-2232, CA, USA
e-mail: leifer@cdr.stanford.edu
C. Meinel (*)
Hasso Plattner Institute for Software Systems Engineering, Campus Griebnitzsee, Potsdam,
Germany
e-mail: meinel@hpi.de
Engineers are now taught how to engage with society through empathy training,
coping with multiple points of view, actively managing teamwork, and realizing the
full potential of product and service prototyping (experimenting). In effect, a
critical link has been made between engineering analysis, the science-focused
part of the discipline, which is context independent, and engineering design, the
human society-focused aspect of the discipline, which is context dependent.
(2) Why Must We Accept That Human Needs Are Context Dependent?
Human behavior is overwhelmingly context dependent (Bandura 1986). This
makes the formulation of problems and their solutions difficult.
   As engineering faculty and technical managers, most of us did not teach engi-
neering design nor did we attempt to manage design thinking because we either did
not understand it and/or because it includes unpredictable human behavior within
the system boundary. This lack of understanding and perhaps fear of uncertainty
and ambiguity often leads to skepticism and even contempt for the human side of
engineering—design.
(3) Are We Ready and Do We Know Enough to Change?
The cumulative work of a global design thinking research community demonstrates
our ability to instrument and quantify design behavior. We can measure its impact
on corporate team performance. We have started to understand the underlying
principles. Though valuable insights have been gained and methods and tools
further developed, we are just at the beginning. Biology, too, started as an applied
science with scientists such as Darwin gathering knowledge by doing hands-on
work. Design is now positioned to follow biology as a foundational science.
    However, its application is still, and will always be, the key to a better under-
standing of design thinking. That’s why researchers are striving to gauge and assess
its impact. One such an attempt is the recently published study report “Parts
Without a Whole? - The Current State of Design Thinking Practice in Organiza-
tions” by the project team “Impact by Design Thinking” (Eva K€oppen, Holger
Rhinow, Jan Schmiedgen, and Christoph Meinel). This study is based on survey
results. The researchers not only explored the different forms of design thinking
adoption, but also share their valuable insights derived from years of profound
research on design thinking in organizations (Schmiedgen et al. 2015).
    The benefits of their work can be seen at thisisdesignthinking.net. This website
launched by the research team showcases interesting stories from companies
working with design thinking and publishes interviews with experts and practi-
tioners. By drawing a colorful picture of the manifold design thinking activities
going on today, the website serves the research community as well as coaches,
practitioners and students.
    Many more projects pave the way for making design a foundational science.
(4) Why Is This Breakthrough Happening Now?
While design thinking is practiced universally in varying degrees, it is the unique
combination of engineering (especially IT), economics, anthropology, psychology,
neuroscience and design research that is making it foundational (Mabogunje
et al. 2015). Breakthroughs at Stanford University cannot be separated from the
Manifesto: Design Thinking Becomes Foundational                                                    3
              instrumen ng
                                                                     External
              context is                                             environment
              founda onal
                                Process
                              Instrumenta on
                                               Human Variables
                               Product
                              Instrumenta on
                                               Technical Variables
                               Context
                              Instrumenta on   Context Variables
Fig. 1 We build upon advances in design behavior research and corresponding brain research. We
quantify action and outcomes. We have a mature measure of predictive power. As a profession,
design thinking is foundational and can be understood scientifically
   Now that we have the roots of the scientific comprehension of design thinking
we can expect to continuously improve our understanding of ourselves. We can
additionally discover new practices and disseminate these practices through publi-
cations, simulations, and emulations. This understanding, which derives from the
study of human-human interaction with IT augmentation is embodied in the nature
of language—verbal and gestural. It is foundational (Fig. 1).
References
Bandura A (1986) Social foundations of thought and action: a social cognitive theory. Prentice
   Hall, Englewood Cliffs, NJ
Donald M (1991) Origins of the modern mind: three stages in the evolution of culture and
   cognition. Harvard University Press, Cambridge, MA
Mabogunje A, Sonalkar N, Leifer L (2015) Design thinking: a new foundational science for
   engineering. In: To appear in Proceedings of the Mudd design workshop, Harvey Mudd
   College, Claremont, California, 28–30 May 2015
Schmiedgen J, Rhinow H, K€ oppen E, Meinel C (2015) Parts without a whole?—the current state of
   design thinking practice in organizations, Technischer report. Study report, Hasso-Plattner-
   Institute, Potsdam
Introduction: The HPI-Stanford Design
Thinking Research Program
Since 2008 scientists from the Hasso Plattner Institute for Software Systems
Engineering in Potsdam, Germany, and from Stanford University, USA, have
engaged in the joint Design Thinking Research Program, financed and supported
by the Hasso Plattner Foundation.
Divided into four parts, this book compiles the results of the research projects of the
6th program year, covering multifaceted features and aspects of design thinking.
   Team collaboration is the basis of design thinking and thus of significant
importance in the program’s research agenda. Different “Tools and Techniques
for Improved Team Interaction” are presented in the first part of this book. These
tools and techniques, which have been investigated and developed in the program,
range from the Tele-Board for remote collaboration, to IDN for visual diagnostics,
to Google glasses in health applications. A technique for physical interaction and a
discussion system to leverage geographic diversity in massive online classes are
also introduced.
   The chapters in Part II all focus on “Creativity and Creative Confidence”—
central factors when engaging in design processes. The research projects range
from neurological studies on sustainable creativity trainings to the influence of
spatial factors on creativity and how creative confidence can be augmented with the
help of prototyping tools.
   A question that is often asked by people who engage in design thinking, or who
intend to do so, is: what is the actual impact of design thinking? Part III “Measuring
Design Thinking” contains a study giving insight into how the impact of design
Introduction: The HPI-Stanford Design Thinking Research Program                     7
Remote collaboration processes require digital tools supporting work over dis-
tances. Transferring physical artifacts to the digital world and facilitating their
flexible usage are common cases in remote settings. In “Globalized Design Think-
ing: Bridging the Gap Between Analog and Digital for Browser-Based Remote
Collaboration” Matthias Wenzel, Lutz Gericke, Christoph Thiele and Christoph
Meinel present new developments that they have integrated into their remote
collaboration software system Tele-Board to support such use cases. The authors
describe a software tool for automatically digitizing analog whiteboard artifacts
that can then be used in remote settings on a shared virtual whiteboard surface via
Tele-Board or a web browser-based virtual whiteboard application. This tool allows
shared real-time collaboration and, by closing the media gap, lowers the hurdle of
switching from analog co-located to digital remote working modes.
   The tools currently available for effective team coaching are limited to heuristics
derived from either experienced design thinking professionals or clinical psychol-
ogy practitioners. In “Visual Diagnostics for Design Thinking Teams” Neeraj
Sonalkar, Ade Mabogunje, Gina Pai, Aparna Krishnan, and Bernard Roth aim to
improve this current situation by providing design thinking managers, coaches and
instructors a scientifically validated tool for augmenting design team performance.
They present the development of a software tool called the IDN Tool, based on the
Interaction Dynamics Notation. This tool analyzes team interactions and diagnoses
patterns of behavior that influence design outcomes.
   In “Design Thinking Health: Telepresence for Remote Teams with Mobile
Augmented Reality” Lauren Aquino Shluzas, Gabriel Aldaz, and Larry Leifer
examine the capabilities and boundaries of a hands-free mobile augmented reality
(AR) system for distributed healthcare. They use a developer version of the Google
Glass™ head-mounted display to develop software applications to enable remote
connectivity in the healthcare field. With this technology, the team characterizes
system usage, data integration, and data visualization capabilities. Further, they
conduct a series of pilot studies involving medical scenarios. This chapter discusses
8                                                                        C. Koch et al.
the need for a AR head-mounted display to improve chronic wound care photog-
raphy and to facilitate surgical interventions. The authors provide an overview of
the system architecture used in this research, and highlight future applications of
AR systems for improved clinical care.
   Massive online classes are global and diverse. How can we harness this diversity
to improve engagement and learning? Currently, despite high enrollment, the
interaction between students is minimal. In fact, most students participate without
reciprocal action with others. This isolation is particularly disappointing given that
a global community is a main draw of online classes. “Talkabout: Making
Distance Matter with Small Groups in Massive Classes” by Chinmay Kulkarni,
Julia Cambre, Yasmine Kotturi, Michael S. Bernstein, and Scott Klemmer illus-
trates the potential of leveraging geographic diversity in massive online classes.
The authors connect students from around the world through small-group video
discussions. In this work, the authors challenge the view that online classes are
useful only when in-person classes are unavailable. Instead, they describe how
diverse online classrooms can create benefits that are largely unavailable in a
traditional classroom.
   Over the past 2 years, David Sirkin, Brian Mok, Stephen Yang, Rohan
Maheshwari, and Wendy Ju have followed an improvisational approach to physical
interaction in design research. This approach emphasizes the use of exploratory lab
and field experiments as a way to (a) challenge our assumptions about how people
might interact with expressive objects such as robots and active spaces, (b) appraise
the performance of our prototypes of these technologies, and (c) build frameworks
to understand users’ mental models and develop new insights into interaction. They
have focused, in particular, on creating environments—whether in public settings
or recreated in their workspace—where they can observe people’s natural reactions
to subtle changes in their routine, or to more explicit provocations designed to
understand their deeper thought processes. “Improving Design Thinking
Through Collaborative Improvisation—How We Use Naturalistic Environ-
ments to Test Design Ideas” describes how the team designs and runs experiments
to evaluate how people interact with robots built from everyday objects.
Grace Hawthorne, Manish Saggar, Eve-Marie Quintin, Nick Bott, Eliza Keinitz,
Ning Liu, Yin-Hsuan Chien, Daniel Hong, Adam Royalty, and Allan L. Reiss
contribute “Designing a Creativity Assessment Tool for the 21st century: Pre-
liminary Results and Insights from Developing a Design-Thinking Based
Assessment of Creative Capacity”. In order to assess a person’s creative capacity
in real-world situations, they propose a novel Design Thinking Creativity Test
(DTCT). The DTCT is based on the design thinking principles and can serve as
an assessment to reflect problem-solving needs of the twenty-first century. In
particular, the DTCT emphasizes assessment of case-based skills to directly
Introduction: The HPI-Stanford Design Thinking Research Program                     9
“Measuring the Impact of Design Thinking” by Jan Schmiedgen, Lea Spille, Eva
Köppen, Holger Rhinow, and Christoph Meinel focuses on how organizations
measure the impact of design thinking. This article is based on a quantitative survey
that was supplemented with qualitative interviews with experienced design
thinkers. Even though the majority of respondents perceive that design thinking
has an impact of some kind, only a small number of them attempt to measure it in
some form. Those who do not measure the impact of design thinking often do not
10                                                                      C. Koch et al.
know how or lack the resources to do so. The metrics of those who do measure the
impact vary considerably, but customer feedback and satisfaction is a recurring
theme. The authors propose that traditional means of performance measurements
are often ill-suited for evaluating the impact of design thinking. They conclude with
a promising industry example of how traditional measures are used to gauge overall
performance and a story-based approach to capture the role of design thinking.
   The creative behaviors that underpin design thinking are also difficult to mea-
sure. This is problematic because people who have a desire to practice design
thinking in an organizational context are often assessed solely on their ability to
perform via traditional metrics. Therefore they have less incentive to work in a
creative way that crosses the boundaries of these traditional metrics. In order for
organizations to fully support and incentivize design thinking, they must measure
creative behaviors as much as they do execution behaviors. The chapter “Devel-
oping Design Thinking Metrics as a Driver of Creative Innovation” by Adam
Royalty and Bernard Roth highlights a suite of initial metrics that have arisen from
research on d.school alumni and organizations who apply design thinking as a core
driver of their innovation strategy.
Design teams have different problems and needs during their projects. For the
teams’ coaches this can mean a high workload due to familiarizing themselves
with new topics and preparing different coaching sessions for the teams. On the
other hand, past teams might have experienced similar problems, and the experi-
ences and solutions of members from such teams could be valuable for current
teams. In “Experience and Knowledge Transfer through Special Topic
Coaching Sessions,” Franziska Häger, Thomas Kowark, and Matthias Uflacker
present the concept of special topic coaches that allows former members of design
teams to provide additional coaching sessions to current teams. This concept makes
it possible for experienced members of design teams to share their knowledge and
experiences with a current team by preparing and running a coaching session. The
researchers implemented the concept with the help of a coaching seminar parallel to
one of their design engineering courses. They present their evaluation results and
the changes they adapt to the next iteration of the seminar as well as their ideas of
how to adopt this concept in a company context.
    Documentation is a field of active research for the community of both design
thinkers and medical practitioners. One major challenge is to combine the benefi-
cial features of analogue and digital documentation. Since documentation needs are
quite similar in the fields of design thinking and behavior psychotherapy, an intense
collaboration has emerged between these disciplines. The design thinking tool
Tele-Board has been adapted for documentation purposes in behavior
Introduction: The HPI-Stanford Design Thinking Research Program                    11
psychotherapy and yielded a first medical application of the new tool Tele-Board
MED. In the course of tool adaptation, additional features have been developed
such as an automatic protocol function. These new features are not only useful for
therapists but beneficial for design thinkers as well. In “Smart Documentation
with Tele-Board MED” Julia von Thienen, Anja Perlich, Johannes Eschrig, and
Christoph Meinel explain why collaborative work on documentation tools is par-
ticularly promising at the intersection of design thinking and behavior
psychotherapy.
   Programming stands in the focus of the chapter “Preserving Access to Previous
System States in the Lively Kernel” by Lauritz Thamsen, Bastian Steinert, and
Robert Hirschfeld. In programming systems such as the Lively Kernel, program-
mers construct applications from objects. Dedicated tools make it possible to
manipulate the state and behavior of objects at runtime. However, when program-
mers make mistakes in such programming systems, they need to undo the effects of
their actions. Yet, recovering previous states is often error-prone and time-
consuming. This report presents an approach to object versioning for systems
such as the Lively Kernel. Access to previous versions of objects is preserved
using version-aware references. These references can be resolved to multiple
versions of objects and, thereby, allow reestablishing preserved states of the system.
The authors present a design based on proxies and an implementation in JavaScript.
   Companies implement innovation processes or outsource them to external con-
sulting companies to gain a competitive business advantage. However, as innova-
tors and engineers are seldom the same people, it is inevitable that innovation
projects will be documented prior to a subsequent information handover. In prac-
tical application this information handover seldom goes smoothly due to missing,
incomplete, or non-traceable documentation of innovation projects. The situation
becomes even worse when important design rationales, design paths and design
alternatives become no longer retrievable. The missing information may lead to an
innovation that was not intended by the innovators. The retrieval of design paths,
design rationales, and design alternatives requires high manual effort—if it is even
possible at all. In “Connecting Designing and Engineering Activities III”
Thomas Beyhl and Holger Giese present a recovery approach that eases the
retrieval of design artifacts by recovering design paths before the actual retrieval
happens. In order to do this the authors employ a recovery approach using recovery
modules that implement knowledge extraction procedures and recovery algorithms.
The authors evaluate their recovery approach using inventory documentation col-
lected in educational design thinking settings.
3 Summary
models and conducted studies and thereby laid the groundwork to make design
thinking foundational.
   We thank all authors for sharing their research results in this publication. Our
special thanks go to Dr. Sharon Nemeth for her constant support in reviewing the
contributions. We would be delighted to get in contact with our readers for further
discussion and an exchange of ideas. We invite you to visit our websites. At www.
hpi.de/dtrp you will find the latest information on past and present research
conducted within our program. Learn more about all projects and the researchers
behind them. Beyond that, the Electronic Colloquium on Design Thinking Research
(ECDTR, http://ecdtr.hpi.de) is an ideal forum for the rapid and widespread
exchange of ideas, methods, and results in design thinking research. The purpose
of this forum is to use electronic media for scientific communication and discus-
sions in the design thinking research community. The ECDTR welcomes papers,
short articles and surveys.
   Last but not least, the new website thisisdesignthinking.net offers an easily
accessible overview of current developments in design thinking. This pool of
examples and interviews, enriched with scientific explanations, helps localize all
existing expressions of design thinking, including their advantages and downsides.
Here, practitioners searching for advice on design thinking have an opportunity to
meet their “corporate twin.” This takes the form of analogous organizations, which
have the potential to assume a role model function. For educators, the website serves
as a source of inspiration for recharging their teaching materials, explanatory models
and perspectives on current problems in design thinking practice. Please get in touch
with us to share your experiences and stories via thisisdesignthinking@hpi.de.
   We invite and encourage you to engage in dialogue with us on your ideas,
questions, experiences and insights. May this publication serve you as a deep-
dive into design thinking tools, methods and metrics. We hope you enjoy the read
and that this book becomes a source of inspiration for your own work.
                                  Part I
Tools and Techniques for Improved Team
                             Interaction
Globalized Design Thinking: Bridging
the Gap Between Analog and Digital
for Browser-Based Remote Collaboration
1 Introduction
Team-based working modes are an integral part of Design Thinking. Those teams,
especially in corporate environments, are increasingly distributed between loca-
tions over the globe. The question of how this remote collaboration can be accom-
plished by Design Thinking teams has been our research focus in recent years
(Gumienny et al. 2012a, b; Gericke et al. 2012a). A result of this research was the
design and implementation of the remote collaboration tool Tele-Board.
    During collaboration, the digital tool is not only a plain functional instrument. In
fact, with its usability and acceptance it is a relevant factor for the teams’ success.
Thus, making the tool an important player among the members of a team requires
its interplay with the team’s working situation.
    There are two aspects that we focused on during the usage-driven iterative
development of Tele-Board and its support for distributed Design Thinking work-
ing modes.
    Mobile devices such as smartphones and tablets play an increasingly important
role in today’s working environments. In order to participate in remote collabora-
tion processes supported by Tele-Board, our system must be fully accessible by
those devices. Current technologies utilized by our system make it difficult to use
Tele-Board in mobile scenarios.
    A second point affects the decision of using Tele-Board during Design Thinking
activity in general. To date, this decision has to be taken right in the beginning of
the collaborative process. Starting with analog tools in an early phase of the process
and later changing to a digital mode, which is supported by Tele-Board, is difficult.
So far, there was no efficient way of transferring real world artifacts, such as sticky
notes or handwriting on a whiteboard to the digital world of Tele-Board. Actually,
this resulted in an all-or-nothing decision of whether to use our tool or not.
Switching from analog to digital was a manual process and therefore a tedious
undertaking.
    In this chapter, we present solutions for broadening the system’s availability on
mobile devices as well as increasing its flexibility of usage within the collaboration
process.
    We describe a web-based re-implementation of the Tele-Board whiteboard
component. This application is now completely executable within web browsers
without additional software requirements. Browsers are nowadays available on a
broad range of devices from traditional desktop computers to mobile devices such
as phones. This extends the availability of Tele-Board to a multitude of different
platforms. Furthermore, we present our newly developed software tool for auto-
matically digitizing analog whiteboard artifacts that can then be seamlessly used in
Tele-Board. This allows using originally analog content in distributed settings
as well.
   The existing system consists of different components (see Fig. 1), each espe-
cially designed for fulfilling a specific task:
• Whiteboard Client: Serves as the main user interface by presenting the digital
  whiteboard drawing surface. It enables people to do the work they are used to on
  traditional whiteboards by using common metaphors such as sticky notes, hand
  written sketches, or any other kind of content stuck to the board. Often, a
  whiteboard client will be started on a large digital whiteboard device, but can
  also be run on any Java-enabled computer.
• A communication server: Every whiteboard client connects to this server and
  thereby subscribes to updates of a specific panel. Content is organized in projects
  and panels and a project can consist of multiple panels. A panel is a whiteboard
  session, where people are working, and this includes the whole history of their
  interaction. Changes made on one location are automatically synchronized to all
  other locations in real time. So people are seeing instantly what people on the
  other side are doing. For example, while dragging a sticky note, the remote side
  can follow it moving over the board.
• Tele-Board history: This is a component embedded into the server. Every bit of
  synchronization communication is captured and immediately stored in a central
  database. This allows keeping track of the whole interaction process and not only
  snapshots. Every single operation is stored, which allows applications to resume
  from any point in time during the past lifetime of the panel, to copy sessions, and
  run statistics on the usage.
• Mobile devices: Although content can be created directly in the whiteboard
  client, for an efficient workflow that includes (digital) sticky note pads, we
  supply people with a variety of applications for mobile devices. Therefore, we
  developed native applications for different platforms, taking full advantage of all
  available mobile hardware. By using the apps for iOS or Android devices,
18                                                                     M. Wenzel et al.
      collaborators are able to take pictures, make drawings or just input text. The
      content is sent to a panel and can be arranged on the whiteboard surface.
1
    http://www.adobe.com/software/flash/about/
2
    http://www.microsoft.com/silverlight/
Globalized Design Thinking: Bridging the Gap Between Analog and Digital for. . .   19
    Building on the results and recommendations from Gutwin et al. (2011) for
browser-based real-time groupware, we use bi-directional WebSocket3 protocol and
its corresponding API as networking technology.
    The rendering part is implemented using HTML5 Canvas.4 Canvas represents a
rectangular area where graphics can be drawn with the help of a JavaScript API in
the browser. Compared to other technologies available in modern web browsers,
such as Scalable Vector Graphics5 (SVG), the effort for efficient use of Canvas is
higher but results in better rendering performance in our implementation.
    In order to allow a traceable arranging of sticky notes on the whiteboard surface,
a smooth movement animation is required instead of just setting the final position of
the sticky notes after moving them. In this way, our implementation requires
processing a large number of messages sent to the server, while rendering elements
on the whiteboard. The synchronization of whiteboard content and the required
network communication is a suitable use case for HTML5 Web Workers.6 Web
workers provide a mechanism for parallel task execution within web browsers. This
way, networking can be run with minimal interference with the web browser’s user
interface rendering and user interaction.
    Relying on HTML5 standards, modern web browsers, such as Microsoft Internet
Explorer 11, Google Chrome and Mozilla Firefox are supported by our
implementation.
    Implementation details are described in Wenzel et al. (2013).
3
  https://tools.ietf.org/html/rfc6455
4
  http://www.w3.org/TR/2013/CR-2dcontext-20130806/
5
  http://www.w3.org/Graphics/SVG/
6
  http://www.w3.org/TR/workers
Globalized Design Thinking: Bridging the Gap Between Analog and Digital for. . .              21
Fig. 3 Extended Tele-Board system architecture. Client-side parts are fully runnable inside a web
browser
question of how to ease entry into the digital world of Tele-Board when there are
real world artifacts to build upon and these need to be transferred efficiently.
7
  Dell short-throw interactive projector. http://www.dell.com/ed/business/p/dell-s300wi/pd.
(November 30, 2013).
8
  eBeam. http://www.e-beam.com/. (November 30, 2013).
9
  SMART Board interactive whiteboards. http://smarttech.com/smartboard. (November 30, 2013).
22                                                                     M. Wenzel et al.
amazing features for collaboration when needed. At this time, the final tool should
not involve redoing the whiteboard in Tele-Board or settling for immutable photos.
The aim is to design a system that allows for a seamless transition from an analog
whiteboard to a Tele-Board panel. Achieving this would allow teams to utilize
regular whiteboards to experience benefits when work has already been started on a
traditional whiteboard:
• Because there were not enough digital whiteboard devices available
• Because using a regular whiteboard was more convenient
• Because it didn’t really matter and a regular whiteboard was the preferred choice
   Such a symbiosis could help
• with the simple use of Tele-Board as a documentation tool (including handwrit-
  ing recognition (Gericke et al. 2012b) and search)
• document the development of an idea by saving different states of a whiteboard
  via the history
• start an idea transfer, whereby other teams are able to modify rather than simply
  view contents
   While the idea is of digitizing analog whiteboard artifacts is not new and
something that has been tried with photo and video processing; all approaches
had major drawbacks, concentrating on single areas too small to stand a chance of
providing support for Design Thinking sessions. Image processing techniques came
a long way but with handwriting working out well they are often completely
oblivious to the other entities like sticky notes. Other systems are able to recognize
them with multiple camera setups, which can potentially solve the problem. This
solution incorporates the very same limitations that led to tackling the problem in
the first place.
   Our goal is a solution integrated into the Tele-Board landscape that allows a
Design Thinking session to be transformed into a digital Tele-Board panel (see
Fig. 5). This solution should allow for minimal to no loss and thus satisfy require-
ments of Design Thinkers, who expect a solution to be easy to use and self-
explanatory. A software realizing this set of tasks should:
Fig. 5 General idea: transfer manual notes and scribbles into Tele-Board
24                                                                     M. Wenzel et al.
This section will give an overview of the process used to achieve the analog/digital
transformation. Besides giving a rough idea it will specify this process in more
detail. With this process in mind, the following section will introduce the final
application “in action.” Via a system walkthrough the theoretical process will be
linked to the workflow as it is outlined for the user of the proposed solution.
    With Tele-Board being a digital tool, all interaction and input is digital as well.
Our application introduces adding whiteboard content that does not originate in the
digital world. This approach, allowing the transition of an analog whiteboard into
the Tele-Board landscape, means it will finally be possible to directly proceed with
a digital setup.
    The main work is focused on how we can enable classical whiteboard sessions
with global digital collaboration. As stated before, the question relies on a synthesis
of classical whiteboards with digital collaboration tools. While with fully digital
workspaces Tele-Board already achieves this goal, mixed workspaces, especially in
asynchronously working teams, are still dependent on the answer as to how might
we bridge the gap between analog and digital whiteboards (Fig. 6).
    Any acceptable solution has to be non-intrusive in nature. This means that
overhead for the Design Thinking teams must be kept to a minimum. Capturing
the most accurate information as well as a detailed timeline would be possible with
a live capturing system. However this would include a lot of technology that has to
work properly and be set up in the beginning. This effort would cause a barrier
hindering widespread use. The alternative is taking photos of whiteboards. This is
not time consuming especially with the common practice of project documentation
in mind. Therefore taking pictures at important junctures is already part of the
workflow and the process just has to tweak this step in order to get optimal results.
A major disadvantage of this kind of setup is a missing detailed timeline. The photo
represents merely the final thought product. Nevertheless, photo documentation is
acceptable for the following reasons:
Globalized Design Thinking: Bridging the Gap Between Analog and Digital for. . .     25
Fig. 6 System landscape and how it broadens the scope of regular Tele-Board setups
   The following will introduce the process used for the transition of a whiteboard
photo into a Tele-Board panel. Based on the rationale described in the previous
paragraph, it centers on whiteboard pictures.
   In essence, the photo of a whiteboard to digitize will be handed over to an
application which, by using image enhancement and image processing methods,
identifies the whiteboard, notes, drawings and any other content. After this point the
content is imported to the Tele-Board portal to be used as any other regular Tele-
Board panel.
   Naturally—as shown in Fig. 7—the algorithm consists of several steps necessary
to achieving the digitalization of a whiteboard. An aspect, not directly part of the
process, is the camera used for the whiteboard image. While there are no direct
requirements for a specific camera, the image quality has a direct effect on the
performance and reliability of the algorithms. Thereby, without requiring high-end
equipment, any chance of increasing image quality will likely be rewarded by more
detailed and complete end results.
Fig. 7 Depicting the steps taken, starting with a whiteboard photo, to achieve a digital version of
the same information content
Globalized Design Thinking: Bridging the Gap Between Analog and Digital for. . .           27
Description           Illustration
Start
The application
starts with a mini-
mal user interface
introducing the
process steps and
main stage promi-
nently displaying a
drop area. This is
the easiest way of
adding a document
to a non-full
screen application.
                                                                                   (continued)
28                                   M. Wenzel et al.
Description           Illustration
[start] The photo
chosen will be
displayed. The
right side morphed
into a new drop
zone. It can be
used to start work-
ing on a new photo
at any time. Addi-
tionally the other
process steps are
unlocked now and
the user can
proceed.
[preparation] As
the image shows
more than just the
whiteboard itself,
Detect Whiteboard
is used. The result
is presented in the
center with the
identified corners
and option for
manipulation on
the right.
[preparation]
Unsatisfactory
results can be
corrected. The
manual options
pane on the right
allows making the
detected corners
moveable by
mouse. This
mechanism pro-
vides an auto-
mated suggestion
suitable to most
situations com-
bined with the
power of
interference.
                                         (continued)
Globalized Design Thinking: Bridging the Gap Between Analog and Digital for. . .               29
Description             Illustration
[preparation] Once
the whiteboard is
marked it is time
to extract
it. Cropping is a
central step as it is
the last prepara-
tion step and pro-
duces the image
that all the other
algorithms
rely upon.
[entities] Next in
line is entity
detection or more
specifically sticky
note detection.
The image of the
previous step is
used to search for
any sign of sticky
notes. If so, the
contours will be
superimposed for
the user to check
the result.
Complex clusters will automatically be separated. This feature is less reliable than others and is
optional via a user interface switch. If the algorithm missed a certain sticky note color or
mistakenly identified one, the manual option pane provides tools to remedy both situations.
Setting changes will cause reprocessing.
                                                                                       (continued)
30                                                                               M. Wenzel et al.
Description           Illustration
[entities] Addi-
tionally, wrongly
identified sticky
notes can be
removed. A
marker will be
placed in its posi-
tion reminding the
user why this area
was not recog-
nized. This is a
helpful tool to
remove other enti-
ties, e.g. a docu-
ment as shown on
the right.
[export] The
export form allows
specifying all
options needed for
the final import in
the Tele-Board
system. User cre-
dentials and sys-
tem URL are
obvious choices.
As the whiteboard
becomes a Tele-
Board panel, some
information about
the new panel has
to be addressed.
The bottom pane allows the user to specify how the content should be scaled and displayed.
Internally it collects all the data, packages it and handles the communication with the Tele-Board
portal server.
                                                                                        (continued)
Globalized Design Thinking: Bridging the Gap Between Analog and Digital for. . .    31
Description             Illustration
[vectorization]
Content detection
is where the
whiteboard draw-
ings become vec-
tor information
that can later be
manipulated in the
whiteboard client,
e.g. be erased. It is
a silent component
that is called as
part of the export
process, hence no
user interface for
the user.
[tele-board]
Finally, the white-
board photo has
been transferred
into a Tele-Board
whiteboard and
can be used by the
whole team glob-
ally with the
whiteboard client.
If another white-
board needs to be
digitalized, the
user could start
from the beginning
using the drop side
panel.
5 Conclusion
settings was hardly possible. The decision to use our system had to be made in the
beginning of the working process.
   We integrated a new tool in Tele-Board’s system landscape, closing the media
gap existing as long as digital whiteboard tools have been around. It renders
assistance to all those who want a digital tool as universal and flexible as a
traditional whiteboard, yet offering all the benefits of the digital world. The
application starts with a whiteboard photo, a step all too familiar to Design
Thinking teams. By applying advanced image processing algorithms, a digital
clone of the former analog whiteboard is created, enabling classical whiteboard
sessions with global digital collaboration.
   A way was found to overcome obstacles in digital whiteboard use by removing
the bar for switching to digital whiteboards from a regular whiteboard. This
formerly optional technology now becomes yet another tool for Design Thinking,
readily accessible whenever needed.
   Arriving at the digital world, the software tool becomes an integral part of the
working process in a design team. As a team player, Tele-Board has to adapt, in
terms of supported hardware devices used for interacting with the system. Mobile
devices such as mobile phones and tablets are more and more important in today’s
working environments. The previous version of Tele-Board’s whiteboard client
component was based on Java technology, which is not available on mobile
platforms. Furthermore, starting this application became increasingly cumbersome.
Hence, we re-implemented this component on the basis of modern web technolo-
gies allowing its seamless usage from within web browsers on desktop as well as
mobile systems while avoiding Java technology constraints. This way, it allows a
more flexible participation in remote collaboration settings.
   By consequently following the web-based approach, future development will
focus on server side whiteboard detection. The goal is to integrate this component
as a web-based application making Tele-Board an integrated system running in a
browser and therefore accessible by the maximum number of available devices.
References
Anttonen M, Salminen A, Mikkonen T, Taivalsaari A (2011) Transforming the web into a real
   application platform: new technologies, emerging trends and missing pieces. In: Proceedings
   of the 2011 ACM symposium on applied computing—SAC ’11, p 800
Gericke L, Gumienny R, Meinel C (2012a) Tele-board: follow the traces of your design process
   history. In: Plattner H, Meinel C, Leifer L (eds) Design thinking research. Springer, Berlin, pp
   15–29
Gericke L, Wenzel M, Gumienny R, Willems C, Meinel C (2012b) Handwriting recognition for a
   digital whiteboard collaboration platform. In: Proceedings of the 13th international conference
   on Collaboration Technologies and Systems (CTS 2012)
Gumienny R, Meinel C, Gericke L, Quasthoff M, LoBue P, Willems C (2011) Tele-board:
   enabling efficient collaboration in digital design spaces across time and distance. In:
   Plattner H, Meinel C, Leifer L (eds) Design thinking, understand—improve—apply. Springer,
   Berlin, pp 147–164
Globalized Design Thinking: Bridging the Gap Between Analog and Digital for. . .              33
1 Introduction
1
    User empathy and perspective-taking.
2
    Ideation, prototyping, and testing.
N. Sonalkar (*) • A. Mabogunje • G. Pai • A. Krishnan
Center for Design Research, Stanford University, Building 560, 424 Panama Mall, Stanford,
CA 94305, USA
e-mail: sonalkar@stanford.edu
B. Roth
Hasso Plattner Institute of Design, Stanford University, Stanford, CA, USA
2 Research Questions
3 Method
Fig. 1 A conversation between three designers A, B and C is visualized using the Interaction
Dynamics Notation
  Table 1 gives a detailed explanation of each symbol used in the visual notation.
  For further information about the development of the Interaction Dynamics
Notation, please refer to Sonalkar et al. (2013).
The initial research on the Interaction Dynamics Notation depended on paper and
pen based manual analysis of video. This was a tedious and time-consuming
process. In order to use the Interaction Dynamics Notation in a visual diagnostic
instrument viable for real-world use, it was necessary to develop a software tool that
would accelerate the use of the notation. This software tool would then form the
core element of a visual diagnostic system for design thinking teams. The following
design requirements were identified for developing this software tool, which later
became known as the IDN Tool.
1. The IDN Tool must work with video data as well as conversations happening in
   real-time.
2. The IDN Tool must give a visual output in terms of the Interaction Dynamics
   Notation that is easy to understand.
3. The IDN Tool should include sequential analysis methods in order to accelerate
   the analysis of team interaction patterns.
4. The IDN Tool must work on both Windows and OSX platforms.
Diagnostics for Design Thinking Teams                                                           39
Table 1 The visual symbol set for the Interaction Dynamics Notation
Symbols      Name           Description                         Example
             Move           A ‘move’ indicates that a           A: I need to buy Legos
    A                       speaker has made an expression      (at) home. Think about how
                            that moves the interaction for-     therapeutic it would be.
                            ward in a given direction.
    ?        Question       A question indicates an expres-
                            sion that elicits a move. A ques-
                                                                A: Where should we start?
Table 1 (continued)
Symbols      Name           Description                          Example
             Interruption   An interruption is indicative of a   B: Should we start generating
                            speaker being interrupted by         some concepts now?
                            another speaker or at times by       A: Yeah (interrupted by X).
X                           himself.                             X: 10 min are gone.
             Yes and        A move is considered to be a         A: What about. . . if we made a
                            ‘Yes and’ to the previous move       toy that incorporates girls and
                            if it accepts the content of the     boys. Its like a house that has a
                            previous move and adds on to it.     car with it kind of like enables
                                                                 the guys to play with the girls?
                                                                 C: I think that’s a good point to
                                                                 have some sort of a educational
                                                                 point in it.
             Deviation      Deviation indicates a move that      C: But we need to remember it.
      C                     changes the direction of the         C: This is not the buildable
                            conversation from the one            room (deviating from previous
                            implied by the previous moves.       topic).
             Humor          Humor indicates instances of         A: I don’t know I probably
                            shared laughter in teams.            would have swallowed but (All
     A,B                                                         of them laugh).
The IDN Tool software enables a user, an IDN analyst or researcher to import video
data, code the video data so that specific visual symbols are assigned to specific
speaker responses, and then output this assignment of visual symbols as a visual
representation. The IDN Tool also incorporates a sequential analysis functionality
that includes finding patterns of sequential symbol assignments, and conducting a
Markov analysis of the probability of one response following another. Figure 2
describes the basic functionality of the IDN Tool.
Diagnostics for Design Thinking Teams                                                41
   This functionality is achieved through the various functional modules that are
part of the IDN Tool as described in Fig. 3, and the file structure described in Fig. 4.
   The IDN Tool is coded in Python language on a Linux platform. The decision of
using Python on Linux was made in order to create a software code that could then
be easily ported to Windows or mac OSX platforms.
42                                                                    N. Sonalkar et al.
Fig. 4 The corresponding files architecture that forms the IDN code
Figure 5 describes the various screens that are part of the IDN Tool user interface.
   The main video coding screen is shown in Fig. 6. The video is displayed
prominently in the middle. The lower display bar shows the symbols that are
being coded. The right-hand panel shows the speaker label assignments, and the
hot key assignments for each symbol. It also includes a toggle button for coding
start and stop of topic segments.
Diagnostics for Design Thinking Teams                                           43
Fig. 6 The interface elements of the video coding screen of the IDN Tool
   Once the coding is completed, the file can be saved and the visual output
representation and the alphanumeric representation can be exported from the
main menu bar.
   In order to accelerate the coding of video, the IDN Tool can be operated through
a set of hot keys. The keyboard is modified by overlaying the IDN symbols on the
hot key buttons as shown in Fig. 7.
44                                                                     N. Sonalkar et al.
Two team interaction analysis studies were conducted using the IDN Tool. The first
was a comparative study of the IDN Tool with manual method of coding the
Interaction Dynamics Notation. Four concept generation team sessions each of
40 min were analyzed using the IDN Tool as well as by using paper and pen. The
comparison showed that the IDN Tool was 33 % more effective for researchers to
code video data as compared to paper-based coding.
   The second study was designed to test the version 3 of the IDN Tool with a data
set that had measurable design outcome parameters. This study is described in
detail in this section.
Seven teams of three to four participants were given a concept generation task
based on a real-world challenge. The participants were chosen from graduate
students at Stanford University who had previous exposure to design thinking.
The teams were invited in to the Design Observatory (Carrizosa et al. 2002; T€orlind
et al. 2009), which is pre-configured to record multiple video streams for design
activity analysis. The teams were given the design brief that asked them to generate
two concepts for the challenge of lifting water from below 50 ft. underground for
small holding farmers in Myanmar. The design brief was designed to stimulate
conversation and hence it explicitly asked the teams to generate a best-fit concept
that would be technically feasible, and a wild idea or a dark horse concept that was
unlikely to be feasible, but could revolutionize irrigation if it could work. The teams
were given 60 min for concept generation and a further 20 min to sketch their
deliverables and submit them to the research team. Figure 8 shows the team
interaction setting and Fig. 9 shows a sample concept sketch arising from such
team interaction.
   The videos of the seven design teams were imported into the IDN Tool and
analyzed to create visual representations of their concept generation interaction.
The occurrence of ideas during interaction was depicted by highlighting the sym-
bols corresponding to the speaker turns in which ideas were expressed. Figure 10
shows a sample visual representation output from the IDN Tool.
   The use of IDN Tool for analyzing concept generation interactions of design
teams demonstrated its use as a video coding tool. The IDN Tool generated a visual
output, as well as an alphanumeric output of concept generation interactions for
seven teams.
   The IDN Tool used the alphanumeric output to detect sequences of symbols that
occurred more than three times in the concept generation session. Some examples
Diagnostics for Design Thinking Teams                                        45
Fig. 9 A concept sketch developed by one of the design teams in the study
of sequences identified in Team 1 are mmqm (12 times), mshm (14 times), and
mhms (eight times) where m ¼ move, q ¼ question, h ¼ silence, and s ¼ support.
The IDN Tool was able to identify 33 (Team 7) to 142 (Team 2) such sequences in
the concept generation data for each of the teams. The identification of such
interaction patterns addressed the first research question presented in Sect. 2.
46                                                                         N. Sonalkar et al.
Fig. 10 A sample visual output of team concept generation interactions from the IDN Tool
However, even though these interaction patterns were identified, further analysis
was required to address the second research question, and understand which of
these patterns correlated with design outcomes.
Table 2 Concept scores and team score derived from expert ratings
                                    Concept           Concept           Team      Team
                                    average utility   average novelty   utility   novelty
Team      Concepts                  score             score             score     score
Team 1    Tree mechanism pump       2.95              3.2               2.6       3.2
          (dark horse)
          Pressurized u-tube        2.6               3.55
          (best fit)
Team 2    Continuous sponge         2.15              3.05              3.65      3.05
          tube (dark horse)
          Open mine well (best      3.65              1
          fit)
Team 3    Capillary pump (dark      2.25              3.1               2         3.1
          horse)
          Continuous sponge         2                 3.05
          tube (best fit)
Team 4    Continuous belt of fab-   2.15              3.05              2.1       3.05
          ric (dark horse)
          Bamboo deep lift pump     2.1               1.9
          (best fit)
Team 5    Handkerchief wringer      2.55              2.05              3.7       2.05
          (dark horse)
          Two sequential pumps      3.7               1
          (best fit)
Team 6    Handpump                  3.1               1.35              1.35      1.35
          + Archimedes screw
          (dark horse)
          Solar powered propel-     1.35              2.65
          ler pump (best fit)
Team 7    Sponge chain (dark        2.45              3.4               1.75      3.4
          horse)
          Opposing piston pump      1.75              2.5
          (best fit)
The key interaction patterns identified through analysis of the visual output were
then compared with the novelty and utility ratings for the seven teams. The
comparison resulted in the following findings regarding the relationship between
interaction patterns and outcome measures.
1. Episodes of concept elaboration had a strong positive correlation (r ¼ 0.71) with
   utility. The greater the number of episodes of concept elaboration in a team, the
   greater the utility rating of the concepts generated by the team. Figure 11 gives
   an example of an episode of concept elaboration identified in the visual output of
   the IDN Tool.
48                                                                                N. Sonalkar et al.
Fig. 11 This figure highlights an episode of concept elaboration. Concept elaboration episode
consists of at least three consecutive idea expressions indicating that participants are contributing
ideas to develop a particular solution concept
2. Dialectic episodes had a very strong positive correlation (r ¼ 0.9) with utility.
   The greater the number of dialectic episodes in a team, the greater the utility
   rating of the concepts generated by the team. Figure 12 gives an example of a
   dialectic episode identified in the visual output of the IDN Tool.
3. Occurrence of humor had a strong positive correlation (r ¼ 0.55) with utility.
   The greater the number of humor occurrences in a team, the greater the utility
   rating of the concepts generated by the team.
4. Occurrence of yes-and responses had a strong negative correlation (r ¼ 0.83)
   with novelty. The greater the number of yes-and occurrences in a team, the lesser
   the novelty rating of the concepts generated by the team.
Diagnostics for Design Thinking Teams                                                      49
Fig. 12 This figure highlights a dialectic episode found within the visual output from the IDN
Tool. A dialectic episode consists of more than two consecutive block and overcoming responses
indicating that participants are engaged in an argumentative dialectic
9 Limitations
A key limitation of the study is the small number of teams used to test the IDN Tool.
Seven is a small number to obtain results that are statistically generalizable. Still,
this preliminary study shows that it is possible to correlate the patterns identified
through IDN Tool with design outcomes. Thus, the study can be considered
significant not for its results, but for the process of establishing a protocol for
testing a diagnostic instrument for design thinking teams: design activity—expert
assessment of activity outcomes—pattern detection.
10 Discussion
Developing a visual diagnostic instrument for design thinking teams has implica-
tions for design thinking research, education and practice. A key element of a visual
diagnostic instrument for team interaction is the reference database that indicates
whether the pattern detected has any significant meaning in relation to the desired
design outcome. Preparing such a database is itself a valuable research activity that
could help develop a scientific foundation for our understanding of design thinking
teamwork. In terms of instrumentation engineering, preparing such a database
would correspond to calibrating the instrument. A key aspect of calibration with
regards to design thinking is the identification and categorization of the context of
the team interaction that is being analyzed. Since design is a context dependent
activity, we believe that capturing the context in which team interactions occur is
important in order to understand the limits of generalizability of patterns-outcome
relationships. In the study of team interactions presented in this chapter, context
parameters include the nature of the design activity—concept generation, the
familiarity of the participants with each other—the team members were not familiar
with each other before the study, and the familiarity of team members with the
domain of the design brief—most participants were not familiar with the physics of
water flow and pumping that formed the design challenge. We propose to repeat the
study using the IDN Tool with a greater number of teams with varying context
parameters in order to develop a robust calibration of relationships between inter-
action patterns and design outcomes mediated by the context of the design activity.
    The key implication of a visual diagnostic instrument for practice and education
is the capability to inform in-process feedback. Teams could be given behavioral
feedback based on the visual diagnostic instrument that could enable them to
improve their design performance. This feedback could be given either directly to
teams or through coaches. The visual diagnostic instrument could become a
coaching aid to inform coaches about on-going interaction patterns that could be
conducive or detrimental to design outcomes, so that the coaches can then intervene
appropriately. Thus, the development of a visual diagnostic instrument such as the
IDN Tool discussed above has the potential to augment design team performance in
education and practice, while being grounded in rigorous design thinking research.
Diagnostics for Design Thinking Teams                                                        51
11 Conclusion
In this chapter, we presented the development of a software tool called the IDN
Tool that can help identify patterns of team interaction which are positively or
negatively correlated with design outcomes. The IDN Tool was used to analyze
concept generation interactions of seven teams. As we continue using the IDN Tool
to analyze larger amounts of data, we expect to gradually build a database of team
interaction patterns and their correlation with design outcomes. Such a database
would form the reference database that would enable the IDN Tool to function as a
diagnostic instrument for analysis of design thinking teams in industry and acade-
mia. The research presented in this chapter is the first step in that direction.
References
Carrizosa K, Eris Ö, Milne A, Mabogunje A (2002) Building the design observatory: a core
   instrument for design research. In: Proceedings of DESIGN 2002, the 7th international design
   conference, Dubrovnik
Cross N, Dorst K, Christiaans H (eds) (1996) Analysing design activity. Wiley, New York, NY
Edelman J (2011) Understanding radical breaks: media and behavior in small teams engaged in
   redesign scenarios. Ph.D. thesis, Stanford University
Jung M (2011) Engineering team performance and emotion: affective interaction dynamics as
   indicators of design team performance. Ph.D. thesis, Stanford University
Kress G (2012) The effects of team member intrinsic differences on emergent team dynamics and
   long-term innovative performance in engineering design teams. Ph.D. thesis, Stanford
   University
Pfeffer J, Sutton RI (2013) The knowing-doing gap: how smart companies turn knowledge into
   action. Harvard Business Press, Boston, MA
Schar M (2011) Pivot thinking and the differential sharing of information within new product
   development teams. Ph.D. thesis, Stanford University
Shah JJ, Smith SM, Vargas-Hernandez N (2003) Metrics for measuring ideation effectiveness. Des
   Stud 24(2):111–134
Sonalkar N (2012) A visual representation to characterize moment-to-moment concept generation
   through interpersonal interactions in engineering design teams. Ph.D. thesis, Stanford
   University
Sonalkar N, Mabogunje A, Leifer L (2013) Developing a visual representation to characterize
   moment-to-moment concept generation in design teams. Int J Des Creat Innov 1(2):93–108
Tang J, Leifer L (1991) An observational methodology for studying group design activity. Res Eng
   Des 2(4):209–219
T€
 orlind P, Sonalkar N, Bergstr€
                              om M, Blanco E, Hicks B, McAlpine H (2009) Lessons learned and
   future challenges for design observatory research. In: Proceedings of ICED 09, the 17th
   international conference on engineering design, vol 2, Design theory and research methodol-
   ogy, Palo Alto, CA, USA
Valkenburg R (2000) The reflective practice in product design teams. Delft University of Tech-
   nology, Delft
Design Thinking Health: Telepresence
for Remote Teams with Mobile Augmented
Reality
Abbreviations
AR          Augmented reality
CMS         Centers for medicare and medicaid services
DSLR        Digital single-lens reflex
EMR         Electronic medical record
HMD         Head-mounted display
HAPUs       Hospital-acquired pressure ulcers
ICU         Intensive care unit
IMU         Internal measurement unit
1 Introduction
In the healthcare field, the need for improved tools to enhance collaboration among
patients and providers has become increasingly urgent—due, in large part, to a
global rise in aging populations and chronic disease prevalence, coupled with
increasing health care costs and physician shortages worldwide (Mattke
et al. 2010; Magnusson et al. 2004). To address these challenges, this research
examines the use of wearable mobile computing to mediate interdisciplinary
communication and collaboration in healthcare. We present an overview of collab-
orative technologies and interaction modalities for home-healthcare and hospital
use. Advances in mobile computing include technologies to improve remote patient
monitoring, automate simple knowledge-base procedures, and facilitate the deliv-
ery of interventions.
For home healthcare applications, existing systems enable medical staff to remotely
monitor patients suffering from advanced chronic disease and provide prompt
support regarding health education and treatment compliance. The Vsee video
collaboration system (Sunnyvale, CA), for instance, simplifies patient-doctor inter-
actions through web-based video calling, coupled with medical device integration.
    In the consumer health and wellbeing space, a variety of products provide self-
patient monitoring to encourage behavior modifications aimed at promoting health-
ier lifestyles. These include activity monitors such as the Nike Fuelband and Fitbit
Flex, as well as the LUMOback real-time posture feedback system. Yet, despite
current advances, home monitoring technologies are often limited by the depen-
dency on appropriate bandwidth, customized networks and high-cost equipment, as
well as a lack of integration into electronic medical record (EMR) systems.
    In the hospital setting, remote monitoring systems, such as the tele-ICU, aid
clinicians in the delivery of care to ICU patients (Fig. 1). By collaborating with the
bedside team, the tele-ICU silently assists in the delivery of timely interventions
(Goran 2010). Yet, financial barriers associated with installing and operating such
systems has limited widespread adoption.
Design Thinking Health: Telepresence for Remote Teams with Mobile Augmented. . .   55
For patient-virtual agent interaction, the animated virtual nurse (VN) is an auto-
mated system that teaches patients their post-discharge self-care regimen directly
from their hospital beds (Bickmore et al. 2009). This system incorporates a VN who
embodies best practices in health communication for patients with inadequate
health literacy, and illustrates a growing field in mobile computing aimed at
increasing universal healthcare access (Fig. 2).
   Cognitive aids, such as dynamic checklists, present another example of tools to
facilitate collaboration in the clinical setting, through automating knowledge-based
procedures. A recent study involving dynamic checklists found that medical crisis
care situations reveal “a physically complex information space, and relatively high-
tempo time scales of gaze, action, and team-based coordination and communica-
tion” (Wu et al. 2011).
To narrow our research focus, we began with needs finding at Stanford Health Care
from October to December 2013. We interviewed ten nurses and two physicians,
and shadowed four additional nurses (n ¼ 16 clinicians), to generate over
135 needs. We grouped needs into 15 broad clinical areas and ranked each category
on a 5-point scale based on degree of importance to the hospital (pain point),
alignment with research interests, and feasibility (Aldaz et al. 2015) (Table 1).
Design Thinking Health: Telepresence for Remote Teams with Mobile Augmented. . .               57
Table 1 Clinical needs finding, top needs based on a 5-point scale (Aldaz et al. 2015)
                                         Degree of           Alignment
                                         importance to the   with
                    Target application   hospital (Pain      research                    Mean
Clinical area       areas                point)              interests     Feasibility   score
Documentation       Wound care           4                   4             5             4.3
(Video record-      assessment
ings/
Photographs)
Documentation       Wound care           5                   3             4             4.0
(EMR Data)          assessment
Alerts and alert    Surgical monitor-    5                   3             4             4.0
fatigue             ing and Response
                    to cardiac arrest
Improving live      Point-of-view        3                   3             5             3.7
visualization of    sharing during
cases               surgery
Communication       Multiple areas       5                   2             4             3.7
Chronic wounds affect 6.5 million Americans and pose a $25 billion annual
financial burden to the U.S. health care system (Sen et al. 2009). Of particular
importance to hospitals are pressure ulcers (bedsores), which along with venous and
diabetic ulcers, comprise the vast majority of chronic wounds. A pressure ulcer is
“localized injury to the skin and/or underlying tissue usually over a bony promi-
nence, as a result of pressure, or pressure in combination with shear. A number of
contributing or confounding factors are also associated with pressure ulcers; the
significance of these factors is yet to be elucidated” (NPUAP 2007). Studies have
reported that a pressure ulcer extends the length of stay for an acute hospital
58                                                                  L.A. Shluzas et al.
admission by 7–50 days (Graves et al. 2005). Nearly 60,000 hospital patients in the
U.S. die each year from complications due to hospital-acquired pressure ulcers
(HAPUs), while the direct treatment costs for HAPUs at an average hospital range
between $400,000 and $700,000 (Gonzales and Pickett 2011). In 2007, the Centers
for Medicare and Medicaid Services (CMS) made a policy decision not to pay for
the most severe HAPUs (Lyder et al. 2012). Thus, to reduce the incidence of
HAPUs, hospitals are mandating thorough and accurate documentation, which is
necessary to determine the degree of wound severity, evaluate the effectiveness of
therapies, and modify treatment plans as appropriate.
   Increasingly, healthcare facilities are supplementing direct observation with
digital photography for assessment and documentation (Ahn and Salcido 2008).
The National Pressure Ulcer Advisory Panel (NPUAP) strongly encourages that
every wound photograph include patient identification, date and time markings, and
wound dimensions (NPUAC 2014). In practice, nurses write each patient’s personal
identification (ID) information—e.g. name and medical record number (MRN)—on
a paper ruler that they hold next to the wound and include in the picture frame
(Fig. 3).
   Nurses commonly use digital cameras for wound photography, but the process is
often tedious, time consuming, and not private. At Stanford Healthcare, uploading
wound images to a patient’s Electronic Medical Record (EMR) typically requires
up to 2 days. Recent advances in mobile technology have significantly reduced the
time between taking wound photographs and uploading images to a patient’s EMR.
Nonetheless, smartphones and tablets still have a significant drawback for wound
photography: they are hands-on devices. Consequently, two to three nurses are
required to capture wound images—one to hold a camera, a second to hold a ruler
and position the patient, and often a third nurse to assist with patient positioning.
Furthermore, the physical handling of cameras increases the likelihood of cross-
contamination and patient infections (Aldaz et al. 2015).
The system under development builds on prior work in mobile devices for wound
photography. The types of devices capable of non-contact wound photography
include general-purpose devices (e.g., digital cameras, smartphones, and tablets),
wearable (e.g., optical head-mounted displays), and special-purpose devices.
General-Purpose Devices A common way for nurses to take wound photographs
is via dedicated digital cameras (Hayes and Dodds 2003). After taking photos, a
nurse transfers images to a secure, password-protected computer, and deletes the
images from the camera. Since digital cameras are not networked, a nurse would
then transfer the images via a physical memory card. At Stanford Healthcare, nurses
insert a memory card in a printer to make physical prints. He or she then places each
print on a separate piece of paper and fills in relevant information with a pen. Each
day, couriers transport a stack of these documents offsite for scanning. It requires up
to 2 days for the images to be uploaded to a patent’s EMR.
   Epic Systems (Verona, WI) is a leading provider of EMR software (Epic
Systems 2014). The Epic Haiku mobile application allows nurses to take photo-
graphs using a general-purpose smartphone and immediately upload images to a
patient’s EMR, where other clinicians may view the images (Beard and Hamid
2014). Photos taken within the Epic Haiku app are not stored on the phone’s
memory, thereby ensuring compliance with security and privacy protocols (Beard
and Hamid 2014). To use Epic Haiku, a nurse typically holds the smartphone in
both hands while another nurse holds the paper ruler and positions the patient.
Alternatively, one nurse can hold the paper ruler in one hand and the smartphone in
the other, in which case pressing the button to take a photograph is often challeng-
ing (Fig. 4). After taking each image, the nurse must manually enter a text
description, which requires one’s full attention and creates a barrier between the
nurse and patient. After the patient encounter, the nurse meticulously wipes down
all smartphone surfaces to disinfect the device.
   MOWA (Mobile Wound Analyzer) is an application for Android smartphones
and tablets that takes photos and uses software-based technologies to calculate
wound area and identify three types of tissues in the bed of the lesion (Mobile
Wound Analyzer 2014). Presently, however, MOWA does not integrate with
EMRs, thus limiting its utility.
Wearable and Special-Purpose Devices In addition to hand-held digital cameras,
a range of studies has demonstrated the use of head-mounted cameras (e.g. GoPro
Hero3) for clinical use and education (Bizzotto et al. 2013). A major advantage of
wearable head-mounted cameras, “is that the user is not restricted in his or her
manual and oculomotor actions by camera operation tasks” (Wagner et al. 2006).
This allows the performance of actions in a natural way with increased mobility.
Although often WiFi-enabled for remote viewing, wearable cameras typically lack
patient identification and EMR integration capabilities for routine clinical care.
   Special-purpose devices usually include sophisticated wound-measurement
methods, in addition to photographic capabilities. These devices achieve increased
accuracy through vision-based technologies such as stereophotography or struc-
tured lighting. For example, MAVIS III (Perry Baromedical, Riviera Beach, FL) is
a stereoscopic, hand-held camera that measures the area, volume, and circumfer-
ence of chronic wounds (MAVIS III 2014). Traditionally, special-purpose devices
have suffered from one or more of the following drawbacks: they may be expen-
sive, cumbersome to use in a clinical setting, or require significant training time
(Krouskop et al. 2002).
Beyond mobile devices for wound care photography, augmented reality “mediates
ideas between humans and computers, humans and humans, and computers and
humans” (Craig 2013). Such systems can supplement the real world with virtual
objects that appear to coexist in the same space (Zhou et al. 2008), and have the
potential to provide contextual, situated learning experiences for users (Zhu
et al. 2014). With a wearable head-mounted display, the computer can unobtru-
sively display messages to gain a user’s attention, as well as display text or graphics
in one’s visual field (Starner et al. 1997).
    Wearable augmented reality (AR) systems have been demonstrated in a range of
clinical application areas, such as image-guided surgery, pre-operative imaging
training, health-related behavior change, and clinical education (Craig 2013; Dixon
et al. 2013; Doherty et al. 2013). In particular, a range of studies in the clinical field
has demonstrated the use of wearable technology with cameras and optical head-
mounted displays. Bizzotto et al. (2013), for instance, used the GoPro HERO3 in
percutaneous, minimally invasive, and open surgeries with high image quality and
Design Thinking Health: Telepresence for Remote Teams with Mobile Augmented. . .    61
resolution. Likewise, though not widely distributed for consumer use (as of January
2015), a number of proof-of-concept projects and studies using the Google Glass™
optical head-mounted display (Google Glass 2013) have been demonstrated in the
medical field. The Glass technology can reflect projected images while allowing the
user to see through the device’s eyepiece. Medical applications for Glass have
included remote collaboration between doctors, heads-up viewing of vital signs,
medications, and lab reports, as well as live streaming of operations to medical
students (Glauser 2013). Several reports document the live two-way broadcasts of
actual patient surgeries by doctors wearing Glass in the operating room (Lutz and
Kwan 2014; Medical Xpress 2013).
    Although studies have examined the use of wearable AR systems for both
in-patient and remote medical procedures (Lutz and Kwan 2014; Medical Xpress
2013; Dixon et al. 2013), there is a growing need for the development of integrated
systems that enable users to seamlessly capture and annotate visual images in a
hands-free manner, and integrate this information with a patient’s EMR. Outside of
wound care, a specific application in which hands free image capture is particularly
critical includes point-of-view (POV) data sharing during surgery.
    The development of a software application for POV sharing during surgery
requires three key elements: voice commands, bi-directional communication, and
EMR data transfer. The use of voice commands to control digital image capture in a
hands-free manner is particularly useful for clinicians to maintain heightened
sterility while performing surgical procedures. Bi-directional communication is
required to enable collaboration between surgeons wearing a head-mounted display
and remote colleagues, as well as to establish connectivity with remote sensors in
the healthcare environment.
    In addition to software development, the physical augmentation of head-
mounted displays is needed to bring wearable technology and wearable computing
to the operating room environment. Physical enhancements include, for instance, a
transparent splash shield that surgeons may adhere to the front frame of a head-
mounted display for protection from infectious disease, an optical loupe mounted to
the frame (in front of a surgeon’s eye) in order to increase magnification for surgical
procedures, and elements encased on an HMD in a protective cover for improved
cleaning and robustness during routine clinical use.
To address the needs for improved hands-free visualization, image capture, and
EMR transfer, we are currently developing a Google Glass application (known as
“Glassware”) and an Android smartphone application. This application was ini-
tially designed to be used by nurses for chronic wound photography (Shluzas
et al. 2014a, b; Aldaz et al. 2015), but is now extending into surgery and other
clinical areas. Google Glass was selected as an initial platform for research, since it
has an optical head-mounted display that is capable of taking pictures and recording
62                                                                            L.A. Shluzas et al.
videos using an integrated camera. The device also has the ability to communicate
wirelessly via WiFi and Bluetooth. Additional sensors integrated into the device
enable the development of augmented reality-based applications. We developed
Glassware applications using the Glass Development Kit (GDK) for Android 4.0.4
(API Level 15, Ice Cream Sandwich), and the smartphone app for Android 4.3 (API
Level 18, Jelly Bean) (Shluzas et al. 2014a, b).
    The novel features of the system include (i) barcode scanning using the Glass
camera, and tagging subsequent images with a patient’s personal identification
information that is embedded in the barcode, (ii) capturing a live video preview
in the Glass eyepiece before a photo is taken, (iii) using a double-blinking gesture to
take photographs, utilizing Glass’ IR sensor, and (iv) using a head-tilt gesture
(while in the preview mode) to zoom in and out of an image, and a head-tilt gesture
to send images to a patient’s EMR, through the use of Glass’ internal measurement
unit (IMU) sensor (Aldaz et al. 2015).
Google Glass has four sensors that we have used to implement the system for
hands-free wound photography: a camera, an inertial measurement unit (IMU), an
infrared sensor facing toward the user’s right eye, and a microphone (Fig. 5).
Camera with Digital Zoom Albrecht et al. (2014) concluded that Glass was
efficient for acquiring images, but for photo documentation in forensic medicine,
the image quality was inferior to those from a Digital Single-Lens Reflex (DSLR)
camera. This finding is perhaps not surprising, considering that Google Glass uses a
standard 5-megapixel (2528  1956) smartphone camera with a fixed focal length
of 3 mm. The Glass camera has a wide-angle lens that captures a field of view too
Fig. 5 System leveraging the camera and internal sensors of the Glass™ wearable computing
device to create a mobile hands-free application for capturing, tagging, and transferring digital
images to a patient’s medical record (Aldaz et al. 2015)
Design Thinking Health: Telepresence for Remote Teams with Mobile Augmented. . .   63
large for most typical medical applications and illustrates the need for zoom
(Muensterer et al. 2014).
   Because Glass has a fixed focus, all zooming is digital and does not lead to a gain
in optical resolution. In theory, a photo could be cropped in post-processing to
obtain an identical image as one taken after digital zooming, but our preliminary
research showed that nurses appreciated having the ability to zoom before taking
the picture. In practice, we discovered one drawback with zooming before picture
taking. Usually, Glass uses burst-mode photography (capturing a rapid sequence of
shots to improve dynamic range or reduce noise) when instructed to take a picture
(Google + Glass Update 2014). Burst-mode is lost when taking a picture after a
zoom, resulting in lower quality photos.
Barcode Reader Hospital patients wear ID wristbands printed with a medical
record number and corresponding barcode. We implemented a barcode reader
based on the open source, cross-platform library ZBar (2014). ZBar works very
well with newer smartphones, like the iPhone 5, that have auto-focusing cameras.
Since the Google Glass built-in camera has a fixed focus, we used trial and error to
find a barcode size that ZBar could read at arm’s length, where the image was
focused (we found that objects closer than approximately 30 cm would not be in
focus). Furthermore, we had to use a flat barcode, rather than the curved barcode on
wristbands used at most hospitals.
Microphone The Android 4.0.4 GDK version allows developers to designate a
custom voice command to launch an application. The Glassware digital image
capture applications we developed is launched with the command, “ok Glass,
take clinical image.” We also used the microphone for wound annotation purposes,
whereby our app sent audio to the Google servers for speech-to-text conversion.
Previous investigators were surprised to find that Glass recognized complex med-
ical terms such as “Microvillous Inclusion Disease” about half the time, but
cautioned that for reliable medical use, however, the error rate had to decrease
substantially (Muensterer et al. 2014).
IMU and Infrared Sensor Ishimaru et al. (2014), who argued that the inertial
measurement unit (IMU) and blink detection are the most characteristic features of
the Google Glass platform, studied combining head motion with blink frequency
for activity recognition. For the our research and testing purposes, the IMU and
infrared sensor each provided a separate input for hands-free interaction. We used
the IMU (gyroscope and accelerometer) to measure head tilt and the infrared sensor
to detect double blinks.
6 Conclusion
image capture system in the wound care field. Our initial pilot study revealed that
the system enables hands-free digital photography, tagging, speech-to-text image
annotation, and the transfer of data to an electronic medical record. In its current
implementation, the system leverages Google Glass’ camera and internal sensors to
guide clinicians through the process of taking and annotating wound images.
    Additional applications include augmented point-of-view sharing during oper-
ating room procedures to enhance a clinician’s knowledge base and decision-
making capabilities, while performing surgical interventions. Specifically, since
several individuals are needed to perform a surgical procedure (e.g. the attending
surgeon, assisting surgeons, an anesthesiologist, and multiple nurses), the incision
site is frequently crowded and obstructed from one’s field-of-view. With aug-
mented POV sharing, images projected from an attending surgeon’s vantage point
onto a remote screen provide visual clarity to individuals directly involved in the
surgery, as well as to remote participants and expert advisors. It also helps trainees
and surgical fellows to learn procedures more accurately, by viewing an interven-
tion from the same perspective as an attending surgeon, rather than from the reverse
perspective (e.g. standing on the opposite side of the table)—which is often the case
today. Finally, the use of POV sharing with bi-directional communication capabil-
ities and remote sensor connectivity enables real-time collaboration with a pathol-
ogy lab or expert consultants while performing a surgical intervention. Through
visual overlays, POV sharing can co-locate one’s visual field with information
critical to performing a procedure—such as vital sign information, procedural
descriptions, or MRI scans.
    In the future, we aim to further enhance the robustness and reliability of a mobile
AR system for acute care scenarios in the operating room and emergency room
settings. This involves developing applications with data security and privacy
features that are in compliance with strict hospital security protocols. We are
examining the use of a dynamic digital ruler to measure images in a hands-free
manner, a method of contextual time-lapse image recall using the Glass display to
facilitate nurses in visualizing a wound’s staging and healing progression over time,
and speech-to-text voice recognition capabilities for improved wound annotation
and team-based collaboration.
    In the broader context of distributed collaboration for improved healthcare
delivery, this research aims to examine the use of technology for complex distrib-
uted problem solving through interdisciplinary collaboration; gain an improved
understanding of the benefits of human augmentation through enhanced visualiza-
tion and auditory capabilities, on healthcare team performance; and explore an
AR system’s ability to influence behavior change in situations requiring acute
decision-making through interaction between centralized experts and point-of-
impact delivery personnel.
Design Thinking Health: Telepresence for Remote Teams with Mobile Augmented. . .                65
References
Ahn C, Salcido RS (2008) Advances in wound photography and assessment methods. Adv Skin
    Wound Care 21:85–93
Albrecht UV, von Jan U, Kübler J, Zoeller C et al (2014) Google glass for documentation of
    medical findings: evaluation in forensic medicine. J Med Internet Res 16(2), e53
Aldaz G, Shluzas LA, Pickham D, Eris O, Sadler J, Joshi S, et al. (2015) Hands-Free Image
    Capture, Data Tagging and Transfer Using Google Glass: A Pilot Study for Improved Wound
    Care Management. PLoS ONE 10(4):e0121179. doi: 10.1371/journal.pone.0121179
Beard HR, Hamid KS (2014) Worth a thousand words: integrating clinical photographs into an
    electronic medical record. Healthcare 2(1):22–25
Bickmore TW, Pfiefer LM, Jack BW (2009) Taking time to care: empowering low health literacy
    hospital patients with virtual nurse agents. In: Proceedings of the CHI 2009. ACM Press
Bizzotto N, Sandri A, Lavini F, Dall’oca C, Regis D (2013) Video in operating room: GoPro
    HERO3 camera on surgeon’s head to film operations—a test. Surg Innov. doi: 10.1177/
    1553350613513514
Craig AB (2013) Understanding augmented reality: concepts and applications. Elsevier, Boston,
    MA
Dixon BJ, Daly MJ, Chan H, Vescan AD, Witterick IJ, Irish JC (2013) Surgeons blinded by
    enhanced navigation: the effect of augmented reality on attention. Surg Endosc 27:454–461
Doherty A, Williamson W, Hillsdon M, Hodges S, Foster C, Kelly P (2013) Influencing health-
    related behaviour with wearable cameras: strategies & ethical considerations. In: Proceedings
    of the 4th International SenseCam & Pervasive imaging conference. ACM, New York, NY, pp
    60–67
Epic Systems. http://www.epic.com. Accessed 10 Jan 2014
Glauser W (2013) Doctors among early adopters of Google Glass. Can Med Assoc J 185:1385
Gonzales D, Pickett R (2011) Too much pressure. Stanford Nurse 31(1):12–13
Google + Glass Update. http://plus.google.com/+GoogleGlass/posts/9CdE6ysitvP. Accessed
    21 Jan 2014
Google Glass. http://www.google.com/glass/start. Accessed 5 Aug 2013
Goran SF (2010) A second set of eyes: an introduction to Tele-ICU. Crit Care Nurse 30(4):46–55
Graves N, Birrell F, Whitby M (2005) Effect of pressure ulcers on length of hospital stay. Infect
    Control Hosp Epidemiol 26:293–297
Hayes S, Dodds S (2003) Digital photography in wound care. Nurs Times 99(42):1–3
Ishimaru S, Weppner J, Kunze K, Kise K, Dengel A, Lukowicz P, Bulling A (2014) In the blink of
    an eye: combining head motion and eye blink frequency for activity recognition with Google
    Glass. In: Proceedings of the 5th augmented human international conference. pp 150–153
Krouskop TA, Baker R, Wilson MS (2002) A noncontact wound measurement system. J Rehabil
    Res Dev 39:337–345
Lutz BJ, Kwan N (2014) Chicago surgeon to use Google Glass in operating room. http://www.
    nbcchicago.com/news/tech/google-glass-surgery-237305531.html. Accessed 2 Feb 2014
Lyder CH, Wang Y, Metersky ML et al (2012) Hospital-acquired pressure ulcers: results from the
    national medicare patient safety monitoring system study. J Am Geriatr Soc 60:1603–1608
Magnusson L, Hanson E, Borg M (2004) A literature review study of information and communi-
    cation technology as a support for frail older people living at home and their family carers.
    Technol Disabil 16:223–235
Mattke S, Klautzer L, Mengistu T, Garnett J, Hu J, Wu H (2010) Health and well-being in the
    home: a global analysis of needs, expectations, and priorities for home health care technology.
    RAND Corporation, Santa Monica, CA
MAVIS III. http://www.perrybaromedical.com/accessories-mavis3.html. Accessed 7 Apr 2014
Medical Xpress. First US surgery transmitted live via Google Glass (with video http://
    medicalxpress.com/news/2013-08-surgery-transmitted-google-glass-video.html.           Accessed
    13 Oct 2013
66                                                                           L.A. Shluzas et al.
Abstract Massive online classes are global and diverse. How can we harness this
diversity to improve engagement and learning? Currently, though enrollments are
high, students’ interactions with each other are minimal: most are alone together.
This isolation is particularly disappointing given that a global community is a
major draw of online classes. This paper illustrates the potential of leveraging
geographic diversity in massive online classes. We connect students from around
the world through small-group video discussions. Our peer discussion system,
Talkabout, has connected over 5000 students in 14 online classes. Three studies
with 2670 students from two classes found that globally diverse discussions boost
student performance and engagement: the more geographically diverse the discus-
sion group, the better the students performed on later quizzes. Through this work,
we challenge the view that online classes are useful only when in-person classes are
unavailable. Instead, we demonstrate how diverse online classrooms can create
benefits that are largely unavailable in a traditional classroom.
1 Introduction
which aids learning and growth (Gurin et al. 2002). But, while physical classrooms
often strive to be diverse, they remain limited by physical geography (Kucsera and
Orfield 2014).
    Massive online courses recruit thousands of students from over 100 countries,
bringing together peers with many nationalities and experiences (Olds 2013).
Instructors often advertise how many countries are represented in the class
(Breslow et al. 2013; Konstan et al. 2014; Olds 2013). However, while student
diversity has become a calling card of online education, this potential is currently
un-tapped. Most online students currently see only a glimpse of their peers’ global
diversity, primarily in text discussion forums. This slow-motion communication is
a poor fit for the open-ended dialogue characteristic of dorm hallway conversation
(Jacobs 2013), and can reinforce a one-size-fits-all, broadcast educational approach
(Losh 2014).
    This paper illustrates the potential of leveraging diversity in online classes, and
introduces the Talkabout environment and curricula for small, geographically-
diverse groups in massive classes. Talkabout connects students to their global
peers via guided, synchronous video discussion. Talkabout focuses on harnessing
geographic diversity, where students connect with peers from other parts of the
world. Geographic diversity enables students to access peers with different cultures
(Green 2005), levels of income (Goesling 2001), and beliefs about learning (Purdie
and Hattie 1996).
    Geographically diverse classrooms can improve educational experiences, mak-
ing them deeper and more realistic. Multinational discussions create the oppor-
tunity for what one student called a ‘mini United Nations’, where students experience
first-hand the differing concerns and beliefs of people from different countries.
    Talkabout forms groups of two to nine students from different parts of the world
for a video discussion. Discussion prompts ask peers to relate course content to their
local and personal experiences, encouraging students to reflect on previously
unexamined assumptions about their own environments, and deepening their learn-
ing (Lin and Schwartz 2003). To date, more than 5000 students from 134 countries
have used Talk-about in 14 online classes via Coursera and OpenEdX. This paper
reports results from the first seven courses and 3200 students. These classes
included Social Psychology, Organizational Analysis, Behavioral Economics, and
Logic and Design. Table 1 shows a sampling of topics discussed. The median
discussion had six students from five countries.
    Talkabout’s discussion sessions improved student engagement: students ran-
domly assigned to a Talkabout group were significantly more likely to participate
in class quizzes than those placed on a wait-list for future participation (Wald
z* ¼ 1.96, p ¼ 0.03).
    Geographically diverse discussions yield higher grades and engagement. A
controlled experiment in two massive online classes varied the number of countries
present in Talkabout discussions. Students in more geographically diverse discus-
sions performed significantly better on subsequent quizzes and exams (t(129) ¼
1.78 and t(110) ¼ 2.03, p < 0.05).
Talkabout: Making Distance Matter with Small Groups in Massive Classes                       69
Table 1 Excerpts from discussion agendas from 1 week in different classes. Each question below
included more detailed guidance in the actual discussion
Course title                Representative discussion topics
Critical Perspectives on    How do you define innovation and invention? How do manage
Management                  them? Are shipping containers and labor unions innovations or
                            inventions?
Irrational Behavior         How do you treat money as a relative rather than absolute good? Do
                            you think that it is more painful to pay with cash than credit? How
                            might issues of fairness vary by culture?
Organizational Analysis     Describe your experience in organizations where decisions by
                            organized anarchy occurred. Did they solve anything? How com-
                            mon were they?
Social Psychology           In your country, which forms of prejudice are the most socially
                            acceptable, and which ones are the least acceptable? Why are some
                            forms more acceptable than others?
Think Again                 Since inductive arguments are defeasible, how can it ever be
                            reasonable to trust them? Are arguments from analogy really dif-
                            ferent from inferences to the best explanation?
    Some argue that online education is only desirable when face-to-face education
is unavailable (Freedman 2013). This paper illustrates the benefits of inverting this
proposition: global diversity enables online classrooms to create powerful, previ-
ously unavailable educational experiences and new forms of peer education at scale
that go “beyond being there” (Hollan and Stornetta 1992).
2 Related Work
Kitayama 1991) and holistic versus analytical thinking (Varnum et al. 2010). These
differences impact cognition. For example, when cultures encourage people to
consider objects in relation with their context, they more often apply analogical
thinking. By contrast, when people consider objects in isolation, they more often
apply categorical rules (Varnum et al. 2010).
   To maximize the benefits of diversity, prior work emphasizes two factors: the
numeric representation of diverse groups (structural diversity); and the number of
settings that students interact in (experiential diversity) (Hurtado et al. 1998b).
Ideally, students must meet frequently, and with equal status, in situations where
collaboration is necessary and stereotypes are disconfirmed (Pettigrew 1998), and
where differing views are welcomed (Hewstone 2003).
   Informed by this research, Talkabout forms geographically diverse discussion
groups, and encourages fluid roles and consensus-based decisions that emphasize
equality. Furthermore, Talkabout contributes a curriculum where students can
question stereotypes and compare their views to their peers.
   In most current online classes, students’ opportunities for discussions with
diverse peers are limited to text-based forums. Such asynchronous text channels
inhibit trust-formation (Rocco 1998) and open-ended discussion (Short et al. 1976).
Synchronous channels, such as video, improve participants’ sense of belonging and
willingness to collaborate (Saltarelli 2012). Channels such as video which support
multimodal communication and nonverbal cues are also better suited to ambiguous
discussions (Daft and Lengel 1986) and complex sense-making (DeSanctis
et al. 2003). For these reasons, Talkabout leverages synchronous, small-group
video discussions to encourage meaningful, open-ended dialogue.
   Massive scale presents both a formidable challenge and a powerful opportunity
for online education. Prior work encouraging unstructured discussion failed to find
an improvement in students’ sense of community or academic achievement
(Coetzee et al. 2014). More systematically structured approaches have enjoyed
greater success. One example is the use of rater redundancy and short exercises that
create micro-expertise in peer review: with this structure, peers can provide expert-
quality assessment and feedback (Kulkarni et al. 2013), and act as mentors
(Papadopoulos et al. 2014). Talkabout introduces a structured interaction and
curriculum that leverages diversity.
The Talkabout interface guides instructors through setting up their course discus-
sions, and creating a structured discussion agenda for students (Fig. 2a). This
agenda is displayed throughout the discussion (Fig. 1).
   Students choose a discussion time from the published set (Fig. 2b), up to a week
in advance. As students log in at their selected time, Talkabout assigns them to
groups (instructor can choose group size between 2 and 9). Talkabout has several
policies for group assignment; by default it assigns arriving students to a group until
Talkabout: Making Distance Matter with Small Groups in Massive Classes                         71
Fig. 1 Talkabout provides a structured discussion agenda and enables students from around the
world to discuss with each other
Fig. 2 Talkabout discussion timeline: (a) Instructors enter a discussion agenda, and times for the
discussion. (b) Students pick their preferred time. (c) When they log on to Talkabout at their
selected time, Talkabout assigns them to a group, and creates a private hangout. Students show up
at their selected time, and enter the discussion
it reaches its size limit; then it starts a new group. Other policies, discussed later,
explicitly factor geographic location into group assignment. Discussions occur
through the Google Hangouts platform for multi-person video and audio chat. For
each group, Talkabout creates a discussion session exclusively for the assigned
participants. Discussion groups exist only for the duration of the discussion session.
If students participate in multiple discussion sessions—even in the same course and
on the same topic—they are likely to have different partners, because grouping
depends on students’ arrival order. Consequently, students hear different ideas and
experiences each time.
    During discussions, the Talkabout Hangout application shows the instructor’s
discussion agenda on the left and the video chat on the right. An agenda typically
includes suggested discussion topics or activities (Figs. 1 and 4).
72                                                                              C. Kulkarni et al.
Our early experiences with Talkabout, as well as prior work, suggest that it is
critical to co-design curricular strategies with educational interaction design.
In particular, scripts for discussion have a major impact on student engagement
and learning (O’Donnell and Dansereau 1995). Talkabout succeeds best when discus-
sions create opportunities to highlight students’ diverse experiences. Based on prior
work, we developed three strategies to create discussion scripts or agendas, and
refined them through deployments in seven massive classes. Figure 4 shows these
Fig. 3 Across classes. (a) Students from many countries participate in each six-person discussion.
(b) These students aren’t just from neighboring countries, they are globally distributed
Talkabout: Making Distance Matter with Small Groups in Massive Classes                    73
Self-reference, when students actively relate class content to their own experiences
and perspectives, increases concept elaboration, memory organization, and knowl-
edge retention (Symons and Johnson 1997). Talkabout agendas that employ self-
reference ask students to share personal examples that embody class concepts. Self-
reference is especially effective when students feel safe in discussing personal
experiences. Talkabout groups are small by design to encourage self-disclosure
(Nguyen et al. 2012). As each person shares with the group, it encourages peers to
likewise disclose (Joinson 2001).
   The globally distributed nature of discussions amplifies the benefits of sharing
self-referential frames. After a discussion on prejudice in Social Psychology, one
student wrote, “I think this may have been the first time the lady from Saudi Arabia
had spoken to a Jew [referring to himself]”, showing her a different viewpoint. He
added, “I told her about the prejudice from Christians I experienced growing up in
[US state] in the 1940s and the effect of segregation on blacks,” reflecting on his
own experience.
   Students may see different self-referential frames with different groups.
For instance, even though Social Psychology had only one Talkabout discussion
(with multiple slots), 454 out of 2553 participants in the Social Psychology class
voluntarily attended multiple timeslots.
74                                                                     C. Kulkarni et al.
When a prompt says less, students sometimes say more. Rather than reviewing
every relevant concept, Talkabout discussion agendas reference concepts from
class without any reminders of what they mean. These underspecified references
lead students who have learned these concepts to elaborate, and to act as mediators
with students who would have otherwise not understood them. This is similar to
highly effective offline strategies like jigsaw classrooms, which also rely on peer-
mediated learning and contact with dissimilar peers (Aronson and Bridgeman
2004).
   Creating opportunities for mediation also encourages students to ask about other
class concepts they haven’t understood. For instance, the Organizational Analysis
class used “white flight” (a large-scale migration of white Americans to suburbs
in the 1950s) as an example of an organizational problem faced by cities.
In one Talkabout discussion session, we observed an American student translate
the key ideas in this example to a European classmate by making an analogy to
intra-European migration.
Talkabout discussion sessions usually began with introductions. Since none of the
participants knew each other, introductions were fairly formal and detailed. Partici-
pants typically shared their first name, their country of residence, and a brief
description of their job. Because some participants arrived late to their session,
this introduction phase was often repeated.
   During these introductions, an informal moderator usually emerged. Moderators
often had experience with video-conferencing and a high-bandwidth connection.
They exhibited leadership behaviors such as asking participants to introduce them-
selves, or even explicitly asking to moderate the conversation (e.g. “Shall I lead the
conversation?”)
   After introductions, the informal moderator drew the group’s attention to the
instructor-provided discussion agenda. Even though agendas sometimes suggested
a particular discussion order, participants did not follow it exactly. Instead, they
would interpret the agenda for the major theme it embodied, and negotiate what
they discussed first. Once students finished discussing a particular prompt, they
returned to the agenda to decide the next topic.
   While Talkabout discussion sessions were designed to last 30 min, the median
length of the discussion was 58 min (Fig. 5). With these longer discussions, students
discussed topics that were marked optional, or chose to discuss two topics when the
agenda asked only one etc. Many groups also spoke about the class in general after
the assigned topics. Conversations typically ended soon after the informal moder-
ator (or a talkative speaker) left the discussion, or when no one in the group
suggested a topic to discuss next. As they left, participants often shared how they
enjoyed talking to the group, or taking the class. Moderators sometimes encouraged
the group to stay in touch after the discussion (e.g. “With the other hangouts, we all
added each other on LinkedIn. . . I’ve already added [name]. If you’d like, feel free
to add me.”)
It is not obvious that the benefits of peer discussions (Brookfield and Preskill 2012;
Parker 2000) would transfer to an online environment. In these environments, peers
have vastly different backgrounds and no prior interaction with each other. There-
fore, our first study measures the benefits of participation in online discussions.
Later experiments measure how these benefits vary with geographic diversity in
discussion groups.
    With many educational practices, it is difficult to draw a causal link between
participation and student learning. For instance, students may self-select to partici-
pate. To combat this bias, we use a control condition in which interested students
are actively prevented from discussing. Furthermore, we use an intention-to-treat
analysis that recognizes that some students will not participate, even when given the
opportunity. Therefore, this analysis asks: after controlling for students that don’t
discuss given an opportunity, are discussions effective? Such analysis is common in
clinical trials, where patients that are randomly assigned to a treatment group are
included in the analysis even if they do not take their medication. Because
intention-to-treat analyses take non-compliance into account, they result in conser-
vative estimates of a drug’s effectiveness.
Talkabout: Making Distance Matter with Small Groups in Massive Classes               77
7.3 Participants
Overall, 1002 students were assigned to the Discussion condition, and 122 to the
Wait-list condition. We used an unbalanced design to maximize the number of
students who benefited from discussions. Of those in the discussion condition,
397 attended a discussion.
78                                                                          C. Kulkarni et al.
Students in the Discussion condition were more likely to take the quiz. A logistic
regression indicated that odds of taking the quiz were 1.46 times higher for the
Discussion condition (Wald z* ¼ 1.97, p < 0.05). Students in the Discussion con-
dition also did marginally better on the quiz (t(1122) ¼ 1.89, p ¼ 0.06).1
The average improvement was 16.7 %.
    Thus, even accounting for students who do not follow through, discussions help
students stay engaged in the course and perform better on related assessments.
    While Talkabout participation improves engagement, this effect seems short-
lived. Students who participate in a Talkabout 1 week are not more likely to
participate in the quiz the following week: Wald z* ¼ 1.61, p ¼ 0.10. We also
found no significant improvement in quiz scores for the quiz due the
following week.
    Would participating in multiple Talkabout discussion sessions improve these
short-term benefits? As is typical with online classes, many students shopped the
first weeks, and only 113 students in the discussion condition attended the second
discussion (397 attended the first week). Therefore, our intention-to-treat analysis
lacks the statistical power to capture any benefits of participating in multiple
discussions. Also, while the wait-list design can control for intent to participate,
students who actually participate in discussions may still differ from those who
don’t (e.g. they could be more motivated). An intention-to-treat analysis estimates
effects by assuming participants’ distribution (e.g., for motivation) are similar in the
wait-list and treatment groups due to randomized assignment, but this experiment
does not verify this assumption.
    The results of this study suggest that performance on class quizzes may improve
even with limited participation, and that discussions improve student engagement.
Do these effects depend on the participants in the discussion? Given our hypothesis
that geographic diversity should help learning, our next study investigates the effect
of discussants’ geographic diversity on course performance.
1
  While only marginally significant (p < 0.10), we include this result because it is suggests
opportunities for future work.
Talkabout: Making Distance Matter with Small Groups in Massive Classes             79
North America, Western Europe and the South Pacific. The Same-region condition
grouped students with others from their region. The Multi-region condition grouped
students from anywhere in the world. We discarded data from the South Pacific
region because it had few participants.
8.2 Measures
8.3 Hypothesis
The median number of countries in the same-region condition was two (both from
the same geographical region), while the median in the multiple-region condition
was four.
   Does large geographical distance imply a diverse group? Some World Bank
regions are large, so we examined if multi-region groups had more differing
national viewpoints than same-region groups, taking into account how economic
opportunities and educational experience influence everyday experience (Desai
1991), as do cultural values (Markus and Kitayama 1991).
   We used each participant’s country to map them onto diversity attributes used in
cultural psychology and political science. We use countries as our unit of analysis
because they have a consistent typology of collectivistic or individualistic culture
(Green 2005), organizational attitudes such as inter-personal dependence and
criteria for fulfillment (Shenkar and Oded 1985), economic development (Goesling
2001) and life expectancy (Marmot 2005). While each country is diverse, within-
country differences are smaller than between-country differences (Green 2005),
making this by-country analysis feasible.
   We compared countries of participating students on three dimensions: cultural
values, income, and pupil-teacher ratios in primary school. As a measure of cultural
values, we used the mean overall secular values for each country from the World
Values Survey (2014). Countries with lower scores have societies that emphasize
religion, traditional family values, and collectivistic thinking. The average pair-
wise difference between participants’ countries on the overall secular values scale
was lower in the same-region condition than in the multi-region condition,
Wilcoxon W ¼ 407.5, p < 0.05 (same-region mean: 0.022, equivalent to the differ-
ence between the US and Romania, multi-region mean: 0.031, equivalent differ-
ence: US and Thailand).
   Students’ countries in the Multi-region condition had marginally higher differ-
ences in income levels compared to those in the Same-region condition (t(74) ¼
1.81, p ¼ 0.07; log-transformed because income distribution is log-normal (Cle-
menti and Gallegati 2005)). Using data from the World Bank (World Bank Group
2012), the median per-capita annual income differed on average by $8120 (PPP) in
the same-region condition, approximately the difference between the US and
Canada. The average difference in the multi-region condition was $20,495 (PPP),
approximately the difference between the US and Israel.
   Lastly, students’ countries in the multi-region condition had greater pairwise
variation in educational experience, as reflected in primary school pupil-teacher
ratios (t(74) ¼ 2.00, p < 0.05). Using World Bank data (World Bank Group 2012),
the median differences in the pupil-teacher ratios in the same-region condition were
2.91 (approximately the difference between schools in the US and Canada), while
the median difference in the multi-group condition was 5.91 (the difference in
schools between the US and Russia).
   Collectively, these analyses suggest that multi-region groups brought more
diverse experiences and backgrounds to their discussions.
Talkabout: Making Distance Matter with Small Groups in Massive Classes           81
In Study 3, we sought to confirm and expand upon Study 2’s diversity effect across
more classes and with more students. In doing so, we trade off some of Study 2’s
experimental control in exchange for a much larger sample. We conducted our
experiment across two large online classes, Organizational Analysis and Social
Psychology.
9.1 Participants
discussants received the same discussion agenda and directions as online discus-
sants. Two thousand and thirty-seven students reported participating in an in-person
discussion. Except for qualitative comparisons between online and in-person dis-
cussions, we ignore their data. It is possible that online discussions attracted
students who believed they would benefit more from a diverse discussion. How-
ever, the main results of this study were consistent across both classes.
9.2 Method
designed), and they were used unchanged in the experimental class. The first
Talkabout session was 1 day before the first quiz was due. We analyze the first
2 weeks’ quizzes. The first quiz had 19 multiple-choice questions; the second had
16 (see Appendix 2). Finally, both classes invited students to participate in a post-
discussion survey about their experience.
9.4 Results
Our analysis finds support for the first hypothesis: students perform better on tests
after a more geographically diverse discussion. We find no support for our second
hypothesis that diverse discussion improves retention in the long-term.
In both classes, more diverse discussions led to higher exam grades (Table 2). In
Social Psychology, on the final exam out of 50 points, each additional country adds
an approximate β ¼ 1.78 points (2.4 % of the final grade) to a student’s final exam
score (t(129) ¼ 1.78, p ¼ 0.01). In Organizational Analysis, on the Week 2 quiz out
of 16 points, each additional country yields β ¼ 0.39 points (3.6 %) to the quiz score
(t(110) ¼ 2.03, p < 0.05). However, from the model for the Week 1 quiz (without a
prior-performance measure), we do not see any significant effect of diversity on
score. Prior performance helps capture sufficient variation to make diversity stati-
stically distinguishable from a null hypothesis.
84                                                                            C. Kulkarni et al.
In the Organizational Analysis class, while geographic diversity leads to better quiz
scores 1 week after discussion (Week 2 quiz), we did not find any significant effects
into Week 3. Similarly, we built an ordinary-least-squares linear model for
predicting how many weekly quizzes a student would participate in, based on the
number of countries in their first discussion. We found no significant effect
(t(130) ¼ 0.49, R2 < 0.001). Similar to results from Study 2, this suggests that
the benefits of a diverse discussion only persist for a short duration.
Post-discussion, a survey asked participants about the best part of their discussion.
Two independent raters coded 100 responses about whether comments mentioned
participant diversity: 51 % mentioned it (Cohen’s κ ¼ 0.7, z ¼ 7.04, p < 0.001).
Students noted that diversity yielded different experiences and examples and
perspectives, which challenged ones held by students. A Social Psychology student
wrote how they learned that “. . .in China it is a custom for married women to keep
their surnames, thus I [now] think women changing their surnames when married in
other countries has something to do with sexism.” An Organizational Analysis
student said, “It was interesting to hear about organizations in Australia, Ukraine,
Israel, Indonesia, and Canada. Similar issues appear everywhere regarding deci-
sion-making.”
affect performance after controlling for the number of countries in each group.
Adding the proportion of female participants to the Organizational Analysis class
model for the Week 2 quiz did not improve model fit, and the effect of gender was
not significant: t(100) ¼ 1.1, p ¼ 0.26. The Social Psychology class shows a similar
non-significant effect: t(128) ¼ 0.62, p ¼ 0.53.
We test the following variables in isolation; all were non-significant with p > 0.50.
We found no significant effect of the arrival order of participants on either the
diversity in their group, or the benefits of diversity on course grades. We also find
no evidence that diverse discussions had larger benefits for either gender. Finally,
there was no significant correlation between how early students signed up for a
discussion and their benefits.
The results of our analysis were consistent when we used other measures such as the
pairwise distance between participants’ locations. We use the number of countries
while describing results because it is more interpretable.
9.5 Limitations
This experiment included two classes, Social Psychology and Organizational Ana-
lysis. Both classes used Talkabout in discussions focused on critical thinking and
sense-making. As such, evidence that geographically diverse discussions improve
engagement and learning may not generalize to classes that emphasize procedural
knowledge (e.g. Corporate Finance), or classes where benefits from global per-
spectives are smaller (e.g. physics). That said, even the most procedural topics
require critical thinking and judgment, and as many instructors have found, topics
like physics that seemingly don’t benefit from global perspectives may still benefit
from discussions (Crouch and Mazur 2001; Mazur 2009).
   Geographic diversity encodes many other kinds of diversity, e.g., economic
opportunities, cultural values, and education experience. Each of these dimensions
may have differing benefits for online classes. Future work could build theory that
differences matter when.
86                                                                               C. Kulkarni et al.
10 Discussion
   Video-based discussions are not without their problems today. Some countries
(e.g., Iran) restrict access to Google Hangouts, low-bandwidth connections degrade
the student experience, and installing video-conferencing software remains chal-
lenging for some students. However, these technological limitations are likely to
lessen as bandwidth becomes more plentiful and software comes pre-installed.
Recall that Social Psychology allowed students to choose to run their discussion in
person instead of online. Students participating in the in-person discussions often
turned to close friends and relatives. The shared context made the conversation
friendlier. For instance, one participant re-marked, “I really like the discussion
because it was with my friends. . . It was really easy to start the discussion.”
In-person discussions also had lower geographic diversity. One student summar-
ized, “Being from the same age group, social level and from the same community;
we had very much similar views about the topics in hand.” Students reported
difficulties scheduling discussions and keeping them on-topic. One remarked,
“We had to reschedule a couple of times [before we could meet].” And with friends,
“Turning a conversation towards a scientific discipline such as social psychology
was hard and a bit artificial. . .” An-other remarked, “Members were my family. . .
and speaking about some things is not easy!”
Table 4 Talkabout’s current implementation emphasized in italics, design choices that we found
to be worse are emphasized in bold
Design dimension            Choices
Same discussants every      Yes                 No                        When possible
time?
Group size                  Small                                         Large
Discussion guidance         None                Guidelines and            Scripts
                                                prompts
Role negotiation            Instructor          Technologically           Informal
                            specifies           mediated
Discussion scheduling       On-demand any       Bus stop: regular         Same time every
                            time                intervals                 week
Prior work in CSCL suggests that structuring collaboration between students using
instructions or scripts yields improved learning (O’Donnell and Dansereau 1995).
What is the right degree of scripting? In a between-subjects experiment (n ¼ 82) in
the Organizational Analysis class, we explored the benefit of an enforced script. In
this condition, Talkabout only showed the current discussion topic, and participants
needed to click a button to indicate completion and advance to the next topic.
The control condition agenda showed all topics at once.
   Of the 50 students in the enforced-script condition, only four clicked the “next
topic” button even once. In the post-survey, students also reported they felt the
discussion was less motivating (Mann–Whitney U ¼ 193, p ¼ 0.07), and that they
were less willing to meet the same group again (U ¼ 191.5, p ¼ 0.08). This suggests
Talkabout: Making Distance Matter with Small Groups in Massive Classes                   89
that enforcing a discussion order may undermine the social benefits of Talkabout
(Dillenbourg 2002).
11 Conclusion
This paper suggests that the geographic diversity in online classes can be an
educational asset. Instead of becoming a handicap, distance can expose students
to others and to other ways of thinking. However, leveraging the diversity of online
environments requires careful design. This paper describes one such approach,
Talkabout, which uses video chat to create discussions between students across
the world. Embracing and designing for diversity can enable other innovations.
For instance, instructors could leverage students as co-creators and draw on students’
local observations to showcase how course concepts arise differently around the
world. Likewise, international relations or security courses might launch a
global crisis simulation with each student representing their own region. These
educational experiences offer a glimpse of the potential of thinking “beyond being
there” (Hollan and Stornetta 1992). They are not just leveraging geographic diver-
sity—they would be impossible without it.
their classes, and the many students who connected with their peers and shared their stories with
us. This research was funded in part through NSF grants #1351131 and #1444865, the Hasso
Plattner Design Thinking Program, and the Siebel Scholars Program. This research was conducted
under Stanford IRB protocol #30319.
References
Anderson A et al (2014) Engaging with massive online courses. In: Proceedings of the
   23rd international conference on world wide web, ACM
Aronson E, Bridgeman D (2004) Jigsaw groups and the desegregated classroom: in pursuit of
   common goals. In: Aronson E (ed) Readings about the social animal. Worth Publishers,
   New York, NY, p 532
Bos N, et al. (2002) Effects of four computer-mediated communications channels on trust develop-
   ment. In: Proceedings of the CHI: ACM conference on human factors in computing systems.
   ACM Press
Braskamp LA et al (2008) Assessing progress in global learning and development of students with
   education abroad experiences. Front Interdiscip J Study Abroad 18:101–118
Breslow LB et al (2013) Studying learning in the worldwide classroom: research into edX’s
   first MOOC. Res Pract Assess 8:13–25
Brookfield SD, Preskill S (2012) Discussion as a way of teaching: tools and techniques for
   democratic classrooms. Wiley, San Francisco
Clementi F, Gallegati M (2005) Econophysics of wealth distributions. In: Chatterjee A et al (eds)
   Econophysics of wealth distributions. Springer Milan, Milano
Coetzee D et al (2014) Chatrooms in MOOCs: all talk and no action. In: Proceedings of the
   ACM conference on Learning @ scale. ACM Press, pp 127–136
Coser RL (1975) The complexity of roles as a seedbed of individual autonomy. In: Coser LA (ed)
   The idea of social structure: essays in Honor of Robert Merton. Harcourt Brace Jovanovich,
   New York
Crouch CH, Mazur E (2001) Peer instruction: ten years of experience and results. Am J Phys 69(9):
   970
Daft RL, Lengel RH (1986) Organizational information requirements, media richness and struc-
   tural design. Manage Sci 32(5):554–571
Desai M (1991) Human development: concepts and measurement. Eur Econ Rev 35(2–3):350–357
DeSanctis G et al (2003) Learning in online forums. Eur Manage J 21(5):565–577
Dillenbourg P (2002) Over-scripting CSCL: the risks of blending collaborative learning with
   instructional design. In: Kirschner PA (ed) Three worlds of CSCL. Can we support CSCL?
   Open Universiteit Nederland, Heerlen, pp 61–91
Freedman J (2013) MOOCs are usefully middlebrow. The chronicle of higher education.
   http://chronicle.com/article/MOOCs-Are-Usefully-Middlebrow/143183/
Goesling B (2001) Changing income inequalities within and between nations: new evidence.
   Am Sociol Rev 66(5):745–761
Green EGT (2005) Variation of individualism and collectivism within and between 20 countries:
   a typological analysis. J Cross-Cult Psychol 36(3):321–339
Gurin P et al (2002) Diversity and higher education: theory and impact on educational outcomes.
   Harv Educ Rev 72(3):330–367
Guzzetti BJ et al (1993) Promoting conceptual change in science: a comparative meta-analysis of
   instructional interventions from reading education and science education. Read Res Q 28(2):
   116–159
Hambrick DC et al (1998) When groups consist of multiple nationalities: towards a new under-
   standing of the implications. Organ Stud 19(2):181–205
Talkabout: Making Distance Matter with Small Groups in Massive Classes                           91
1 Introduction
   Improv. . .allows the designer to explore the design solution in all sensory and cognitive
   modalities. . .in a way that cannot be achieved through mere graphic design or well-plotted
   “user scenarios.” While the actor uses empathy to perform dramatic characters in scripted
   situations, the designer uses empathy to perform design solutions that are drawn from deep
   identification with real, individual people in specific situated contexts in the real world.
       -Brenda Laurel
       Design Research: Methods and Perspectives Laurel (2003)
   Our current work explores, how design engineers can draw out people’s implicit
intuitions and expectations for how interactions with machines should transpire. In
this paper, we focus on our use of semi-structured, improvised interaction sessions
in order to prototype devices and platforms, experiment with prompts and
responses, and test social norms.
   The approach evolved from our research in physical interaction design and robot
mediated communication: domains that rely on designers’ knowledge and intuition
of embodied action, which are often only implicitly understood, and therefore can
be difficult to verbalize. Our use of collaborative improvisation has therefore been
particularly helpful in revealing, and making explicit, these internal thought
processes.
Our case studies center on the design of interactions between people and expressive
everyday objects: specifically, how non-anthropomorphic robots can, and should,
interact with people. So what are robotic everyday objects? Our projects include an
ottoman that offers to support a seated person’s feet, a dresser drawers unit that
opens to reveal the right tool just when someone needs it, and a trash barrel that
roams around a dining area and collects trash from diners completing their meals.
We study such objects in motion because interactivity implies sociability, and the
mindful design of such near-future autonomous technologies can help avoid social
miscues, mismatched conceptual models, and unmet expectations. Over the course
of these projects, our goals have been (a) to understand how people respond to
novel, agentic devices during their everyday activities, as well as (b) to develop a
methodology for creating interactions that read naturally to the people involved.
This type of design research takes place within the broader context of embedded
computing, smart devices and home automation.
   The following sections describe how we designed these three prototypes, and
used improvisation to evaluate how best to interact with people, as well as the range
of responses these devices received in use. We seek to understand both normative
behaviors, which are common and expected, and individual responses, which are
unique and often more idiosyncratic.
96                                                                     D. Sirkin et al.
Fig. 1 A participant in an
improvisation session
accepts the ottoman’s offer
to support his feet
The mechanical ottoman is a household robot that approaches a seated person from
across a room, offers to support his or her feet, and after doing so for several
minutes, requests to take leave of the ongoing interaction. Since the main joint
activity in human-ottoman interaction is fairly static, we were primarily interested
in the question of how engagement and disengagement occur. We therefore devel-
oped the ottoman, and a natural interaction scenario (see Fig. 1), specifically to
study how a non-anthropomorphic robot can initiate, participate in, and then
disengage from, a joint activity with a human partner (Sirkin et al. 2015).
There were two distinct phases of design improvisation for the ottoman: an initial,
developmental phase using the hand-puppeted prototype, and a subsequent exper-
imental phase using the remotely-controlled robotic prototype.
We held three design sessions, each lasting about 2 h, with domain experts in
physical movement and interaction, including (a) a dance choreography instructor,
(b) an improvisational theater performer and theater director, and (c) a stage actor.
We placed these participants in various individual seating arrangements, and
puppeted the prototype (see Fig. 2), exploring ways that someone could beckon
or dismiss the ottoman, what personalities different speeds, gestures or angles of
approach and departure projected, and appropriate social distances. We engaged
participants in role play, using prompts such as “shoo the ottoman away as many
ways as you can,” and encouraged them to respond gesturally and speak their
reactions aloud.
   These sessions often pointed out how social and cultural the interpretations of
actions were. At one point, after the robot quickly approached to about a meter
away, followed by a pause, then a gentle move closer, the dance instructor declared
“Ah, now we’re in India,” evoking the tradition of exemplary service being viewed
as an art form in that culture. She elaborated by contrasting India, or England, to
other cultures, such as the United States, which have historical frames of service
being viewed as closer to servitude. The improvisational theater performer echoed
this sentiment when the ottoman withdrew to a ready position in front and to his
98                                                                       D. Sirkin et al.
Fig. 2 During an
improvisation session, the
researcher on the right
puppets the prototype using
two wooden dowels
side, stating “That feels like butlery,” noting how this action showed an intent to be
useful, and allowed it to be recalled to service quickly. The stage actor treated the
ottoman more like an animate object, which he could wave over if needed, sneer at
if he wanted to cross his legs, or kick gently aside if it encroached too closely into
his personal space. Each of these perspectives placed the seated person at the top of
the social order, with the ottoman alternatively treated as provider of expert service,
obedience or pure functionality.
    Several participants felt that a brisk, vertical lift movement of several centime-
ters (which we called “stand up”) suggested attention and a readiness to move, and
that the corresponding drop (or “sit down”) movement suggested stability, and a
likelihood of staying put after completing some action.
3.1.4.3 Approaching
Fig. 3 A typical path taken to approach a seated person. Squares represent the ottoman’s position,
one second apart
Fig. 4 From left to right: one participant anticipates the arriving ottoman by elevating his feet;
another pats it to assuage its offer to support her feet; one rests his feet on it while watching a
video; and another avoids interacting with nervous laughter
held their legs suspended in the air for several seconds, so the ottoman could settle
down just beneath them. Those who did not rest their feet knew that it was an
ottoman, and that it was offering to support their feet, and described their reluctance
as perceiving the ottoman to be alive: “It’s a moving thing that I almost perceive as
living. I didn’t want to denigrate it by using it as a footstool.” Another said,
“Because it felt like it was alive I didn’t want to put my feet on it.” and “I feel
like it communicated with me well, but I would feel uncomfortable doing what it was
asking.” One other participant rested her feet at first, but removed them after about
30 s, saying later, “It seemed like it wanted me to put my feet on it, but I didn’t want
to constrain it too much. I didn’t want to imprison it here.”
Everyone who had rested their feet recognized the quick lift-and-drop motion as a
request to take leave of the interaction, although some did not notice it until the
second or third time: “It, like, sat up when it was ready to go.” and “When it
signaled to leave, it rose and fell, to let me know it was time to go. I wasn’t ready for
100                                                                      D. Sirkin et al.
it to leave because I was comfortable. I’m glad it let me know instead of taking off.”
Responses to the ottoman’s request to withdraw ranged from accepting, “I think it
wanted to go, so I set it free.” to disappointed, “It made its own decision to leave,
like I don’t want to be your footrest anymore.” to mildly annoyed, “I was a little
offput when it decided that it wanted to leave. If it was doing that all of the time,
then I’m not sure how good of a footrest it would be. I expect a footrest to be there.”
    Participants had several rationales to explain its early departure, which ranged
from attending to other routine tasks to taking care of someone else: “I thought it
probably had something to do, to go do some errands.” “Maybe someone else
needed a footrest, or it needed to charge itself.” and “There was probably some-
body more important in the room, so it was going to meet that guy.”
3.2.1 Introduction
The emotive robotic drawers are designed to help us explore how a robot can
participate in an iterative, turn-taking activity with a human collaborator, using its
movements alone. The shared activity can be any task where the human needs
repeated access to objects stored within the robot’s drawers, and where it can open
and close in ways that encourage and support task completion. The example that we
explored most closely is a cube assembly puzzle (see Fig. 5), which requires six
different fasteners, and where the drawers contain all of the tools required to build
the cube Mok et al. (2014, 2015).
We fabricated the prototype from a standard IKEA MICKE 4-drawer unit. To allow
the drawers to perform consistent and repeatable motions, we retrofitted the top
three drawers with motors and a rack-and-pinion system. Spring-loaded rotary
encoders mounted against the unit’s frame allowed us to track each drawer’s
position. The bottom drawer was taped off from access during the study and
contained parts and hardware that drove the system. Actuation was controlled by
an Arduino microcontroller communicating with a local client program over USB
cable.
   The local client program provided a Wizard-of-Oz style remote control for the
drawers. A researcher in an adjacent room operated the drawers via keyboard
hotkey with 15 buttons, each of which was bound to a pre-programmed sequence
of movements, or “animations,” for the drawers to execute. The researcher had a
one-way video feed of both the drawers and participants, which was used to observe
participants’ actions and improvise appropriate responses.
Fig. 6 Drawer animations include (a) simple open, (b) simple close, (c) flair close, (d) wiggle, (e)
beckon, (f) chuckle and (g) happy
102                                                                       D. Sirkin et al.
   The expressive case added flair to the closing animation, and more communi-
cative gestures. For close flair (Fig. 6c), after one drawer closed, the other two
would open and close slightly in sequence, mimicking a ripple effect. Other
animations were designed to suggest to the participant that he or she return or
remove an item from an open drawer. For the former, the drawers would wiggle
(Fig. 6d), where an open drawer moved in and out slightly, but quickly, two times.
For the latter, the drawers would beckon (Fig. 6e), where an open drawer closed
halfway, and then reopened at half speed. Two final animations suggested positive
sentiment. For chuckle (Fig. 6f), all of the drawers opened to random positions,
wiggled twice and then closed. For happy (Fig. 6g), all of the drawers mimicked a
ripple effect, which traveled down the drawers unit twice.
When the drawers were proactive, participants felt that the robot was not treating
them as equals. To one, the robot appeared very much to be “Like a boss.” figure
that relegated people to a lower social standing. Another indicated that “I was the
builder, the drawers should not command me to do things. I will do it when I am
ready.” Yet another felt that “It was distracting when I was trying to understand
what’s going on. I knew it was trying to get me to get the tool for the next step, but I
didn’t know what to do yet. So I just ignored it until I was ready.” Conversely, when
the drawers were expressive, they did not create this feeling of inequality. One
participant noted, “It was like a fiery little Scottish Terrier trying to pull me its
way.” Several others similarly noted that it was “Like a pet.” So, by incorporating
an expressive nature into its actions, the robot can still help lead interactions, while
making participants feel more like equals, and assuaging the frustration and dis-
comfort that they might otherwise feel.
Improving Design Thinking Through Collaborative Improvisation                      103
3.3.1 Introduction
We built and tested a roving trash barrel robot to better understand the implicit
social protocols, cues and signals of public interactions between people and robots,
and to produce movements and actions that people can understand. In eight lunch-
time sessions, spread across two heavily populated campus dining destinations, we
piloted the robot in Wizard-of-Oz fashion through crowded public areas, initiating
and responding to requests for impromptu interactions around collecting people’s
trash (see Fig. 7).
The trash barrel robot was designed to resemble those commonly found around
the university’s campus. Its body is a standard 32 gal BRUTE gray trash barrel
from Rubbermaid’s line of commercial products (see Fig. 8). The barrel is
mounted atop a robotic base, powered by an iRobot Create, and augmented
with a laptop computer (concealed within the trash barrel), two web cameras
(hidden within the barrel’s grab handles) and a microphone (mounted under the
barrel’s topmost lip).
    A researcher remotely controls the robot (Fong and Thorpe 2001), issuing
commands over a web interface to a remote server, which relays the control
commands over WiFi to the laptop, and then by USB to the robotic base (see
Fig. 9). The interface shows the operator two video streams from the cameras in the
robot’s handles. The choice of teleoperation over autonomous control provides
flexibility for real-time improvisation, and permits responsive behaviors to unan-
ticipated events.
Fig. 8 A standard Rubbermaid Brute trash barrel is mounted atop an iRobot Create platform. A
laptop computer, hidden within the trash barrel, handles video and control commands
Fig. 9 The roving trash barrel robot’s control system. Cameras on the robot send video to a remote
operator’s interface on a laptop, which sends motion commands back to the robot
Improving Design Thinking Through Collaborative Improvisation                     105
Owing to its iRobot Create drivetrain, the trash barrel could move forward and
backward, straight or along an arc, or rotate in place: it was incapable of lateral
motion without first rotating. This limitation, combined with the use of cameras to
view and interact with people, created an implicit front for the otherwise cylindrical
trash barrel, and meant that the robot “faced” a person or group as it approach them.
   Aside from speed and direction control, the robot had three pre-programmed
behaviors: wiggling, where it quickly rotated left and right, nudging, where it
abruptly moved front and back, and beeping, where it played a neutral two tone
beep. At high speed, the wiggling and nudging movements appeared more like
shivering and nodding, respectively. The combination of basic drive and
programmed behaviors allowed the operator to maneuver the trash barrel, and
signal intent, improving the legibility and predictability of its actions (Takayama
et al. 2011; Dragan et al. 2013).
We conducted eight sessions, during lunch at two busy dining locations, in which
we explored different ways to approach, gesture, disengage and acknowledge
people. Because our goal was to elicit naturalistic behavior, we did not pre-warn
potential participants that they could be engaged in an improvisational interaction,
but we did interview those who actively engaged with the robot afterward (155 in
total). Due to the crowd and noise, people were rarely aware of the trash barrel until
it approached within an arm’s length distance. It could therefore wander throughout
the area, interacting opportunistically with nearby people along its path, without
their anticipating its arrival.
    There were two phases of improvisation. In the first, exploratory phase, the trash
barrel wiggled, nudged, bumped and beeped to attract attention. It could even
interrupt conversations and make excessive noise by dragging along empty chairs.
In the second, goal-oriented phase, the trash barrel followed a loose script to
encourage people to discard their trash. Scripts called for the robot to either
(a) visit every table along a set path and enact a simple stop-and-move-on behavior
at each one, (b) initiate and respond to requests to engage people by responsive
movement, or (c) purposely appear to struggle, by bumping into chairs or uneven
pavement, with the intent to elicit empathy.
People who were alone were less likely to engage socially with the robot, but if
several people in a group noticed it, they would engage extensively. Overall, people
did not readily interrupt ongoing activities, whether they were watching a video
106                                                                         D. Sirkin et al.
alone or socializing with friends. This correlates with results by Hüttenrauch and
Hüttenrauch and Severinson-Eklundh (2003), who found that people do not attend
to robots when they are already engaged in their own activities.
Most people appreciated the robot’s presence and offer, particularly when they were
in small social groups, where the person closest to the robot could check the responses
of the others before deciding that it was okay to engage. The most common way for
people to signal their unwillingness to interact was to ignore the robot’s presence
(Fischer et al. 2014), although some people overtly avoided interacting by averting
their gaze or turning their faces away, and not responding to the robot’s actions as it
drew near. One woman kept a hand by her face, as if to hold her hair back from her
eyes, but always at an angle that kept her from visually connecting with the robot.
Others avoided looking toward the robot as it approached, but turned around and
recorded video as it departed to interact with others. Still others performed short, curt
interactions, and then pointedly turned away to indicate that the engagement was over.
More observations from this study have been written up in Fischer et al. 2015.
People often waved to attract, or shooed to dismiss, the robot. The most common
gesture to call it over was to wave trash in its line of sight. This pattern is consistent
with the material signals that people use when coordinating joint action, as pointed
out by Clark (2005). But we also observed actions that went beyond just signaling
for coordination. People appeared to ascribe desires and motivations to the trash
barrel, that it intrinsically desires trash rather than just performs a fixed collection
routine. One demonstration of this belief involved a family with a 5-year-old son.
Over 30 min, as the robot interacted with others, the child re-approached the trash
barrel on four occasions, waving trash directly in front of its cameras before slowly
pulling the trash back, in an attempt to coax the trash barrel to follow him, as if the
boy’s mental model of the robot included its intent to consume trash, despite his
never having been informed as such. Adults also seemed to hold this mental model.
Several described throwing away trash as “feeding” the trash barrel, and expressed
a desire for it to acknowledge, or thank, them for having given trash to it.
Both accidentally and intentionally, the trash barrel made a series of mistakes and
exhibited struggling behavior—where it repeatedly tried to reach an objective, but
was unable to overcome obstacles, such as furniture or pavement edges. Some
people felt that the robot was “Not very smart.” and exhibited “Erratic behavior.”
Improving Design Thinking Through Collaborative Improvisation                                 107
But others found struggling to be endearing, characterizing the robot as being like
“A puppy or a toddler.” One said, “When it ran into the garbage cans, I thought,
‘silly robot!’ It was adorable.” This attitude appeared to encourage them to move
obstacles, such as chairs, out of the robot’s way. When asked later why she had
done so, one person replied, “I don’t know, it felt like it was a team effort.” Another
person “Noticed that the robot made people laugh and smile, and when it got stuck,
I wanted to help it because I thought, ‘the show must go on.’”
4 Next Steps
The subject of our embodied design improvisation approach has been to build
interactive robots that co-inhabit spaces with people. As a next step in extending
this method, and in order to design gestures and movements that are more mean-
ingful and socially appropriate, we propose to utilize the “wisdom of crowds” to
help teleoperate robots. We are interested in better understanding when the crowds
are indeed wise, and when they need assistance. We would like to know how best to
aggregate crowd input, to handle transfers of control, and to give remote workers
situational awareness.
    We expect that experimentation with this crowdpowered embodied interaction
approach will help us to elicit context and culture-sensitive rules of interaction that
are key to designing new products; that these techniques will lead to insights that
are more easily codified into machine action than those generated from live,
in-person improvisation; and that end users will respond more positively to ideas
generated from more people than to those developed without the benefit of as much
collaboration.
References
Bell G, Blythe M, Sengers P (2005) Making by making strange: defamiliarization and the design of
   domestic technologies. ACM Trans Comput Hum Interact 12(2):149–173
Bethel C, Murphy R (2010) Emotive non-anthropomorphic robots perceived as more calming,
   friendly, and attentive for victim management. In: AAAI fall symposium: dialog with robots
Clark H (2005) Coordinating with each other in a material world. Discourse Stud 7(4–5):507–525
Dahlbäck N, J€onsson A, Ahrenberg L (1993) Wizard of Oz studies—why and how. Knowl Based
   Syst 6(4):258–266
Dourish P (2012) Designing for zombies. Unpublished manuscript available at http://www.douri.
   sh/publications/2012/zombies.pdf
Dragan A, Lee K, Srinivasa S (2013) Legibility and predictability of robot motion. In: Proceedings
   of the ACM/IEEE international conference on Human-Robot Interaction (HRI 2013), ACM, pp
   301–308
Fischer K, Soto B, Pantofaru C, Takayama L (2014) Initiating interactions in order to get help:
   effects of social framing on people’s responses to robots’ requests for assistance. In:
108                                                                                D. Sirkin et al.
    Proceedings of the IEEE international symposium on robot and human interactive communi-
    cations (RO-MAN 2014), IEEE, pp 999–1005
Fischer K, Yang S, Mok B, Maheshwari R, Sirkin D, Ju W (2015) Initiating interactions and
    negotiating approach: a robotic trash can in the field. In: Proceedings of the AAAI 2015 spring
    symposium on turn-taking and coordination in human-machine interaction, Palo Alto, CA
Fong T, Thorpe C (2001) Vehicle teleoperation interfaces. Auton Robots 11(1):9–18
Garfinkel H (1967) Studies in ethnomethodology. Prentice-Hall, Englewood Cliffs, NJ
Gerber E (2007) Improvisation principles and techniques for design. In: Proceedings of the
    SIGCHI conference on human factors in computing systems (CHI 2007), ACM, pp 1069–1072
Hoffman G, Ju W (2014) Designing robots with movement in mind. J Hum Robot Interact 3
    (1):89–122
Hüttenrauch H, Severinson-Eklundh K (2003) To help or not to help a service robot. In: Pro-
    ceedings of the IEEE international symposium on robot and human interactive communica-
    tions (RO-MAN 2003), IEEE, pp 379–384
Hüttenrauch H, Severinson-Eklundh K, Green A, Topp E (2006) Investigating spatial relationships
    in human-robot interaction. In: Proceedings of the IEEE/RSJ international conference on
    Intelligent Robots and Systems (IROS 2006), IEEE, pp 5052–5059
Johnstone K (1989) Impro: improvisation and the theatre. Methuen, London
Kidd C, Breazeal C (2008) Robots at home: understanding long-term human-robot interaction. In:
    Proceedings of the IEEE/RSJ international conference on Intelligent Robots and Systems
    (IROS 2008), IEEE, pp 3230–3235
Laurel B (2003) Design improvisation: ethnography meets theatre. In: Laurel B (ed) Design
    research: methods and perspectives. MIT Press, Cambridge, MA, pp 49–54
Lee M, Forlizzi J, Rybski P, Crabbe F, Chung W, Finkle J, Glaser K, Kiesler S (2009) The
    snackbot: documenting the design of a robot for long-term human-robot interaction. In: Pro-
    ceedings of the ACM/IEEE international conference on Human-Robot Interaction (HRI 2009),
    ACM, pp 7–14
Leite I, Martinho C, Paiva A (2013) Social robots for long-term interaction: a survey. Int J Soc
    Robot 5(2):291–308
Michalowski M, Sabanovic S, Simmons R (2006) A spatial model of engagement for a social
    robot. In: Proceedings of the international workshop on advanced motion control, IEEE, pp
    762–767
Mok B, Yang S, Sirkin D, Ju W (2014) Empathy: interactions with emotive robotic drawers. In:
    Proceedings of the ACM/IEEE international conference on Human-Robot Interaction (HRI
    2014), ACM
Mok B, Yang S, Sirkin D, Ju W (2015) Every tool in its place: Interaction and collaboration with
    robotic drawers. In: Proceedings of the AAAI 2015 spring symposium on turn-taking and
    coordination in human-machine interaction, Palo Alto, CA
Norman D (1988) The design of everyday things. MIT Press, London
Riek L (2012) Wizard of Oz studies in HRI: a systematic review and new reporting guidelines. J
    Hum Robot Interact 1(1):119–136
Sirkin D, Ju W (2014) Using embodied design improvisation as a design research tool. In:
    Proceedings of the international conference on Human Behavior in Design (HBiD 2014),
    Ascona, Switzerland
Sirkin D, Mok B, Yang S, Ju W (2015) Mechanical ottoman: how robotic furniture offers and
    withdraws support. In: Proceedings of the 10th ACM/IEEE international conference on
    Human-Robot Interaction (HRI 2015), ACM
Takayama L, Dooley D, Ju W (2011) Expressing thought: improving robot readability with
    animation principles. In: Proceedings of the ACM/IEEE international conference on Human-
    Robot Interaction (HRI 2014), ACM, pp 69–76
Weiss A, Bernhaupt R, Tscheligi M, Wollherr D, Kuhnlenz K, Buss M (2008) A methodological
    variation for acceptance evaluation of human-robot interaction in public places. In: Proceed-
    ings of the IEEE international symposium on robot and human interactive communications
    (RO-MAN 2008), IEEE, pp 713–718
                            Part II
Creativity and Creative Confidence
Designing a Creativity Assessment Tool
for the Twenty-First Century: Preliminary
Results and Insights from Developing
a Design-Thinking Based Assessment
of Creative Capacity
1 Introduction
Outside of popular, but dated, psychometric instruments like the Torrance Test of
Creative Thinking (TTCT) (Torrance 1990) and Alternative Uses Test
(Mendelsohn 1976), a widely applicable creativity measurement that takes into
account new information and concepts from recent behavioral- and neuroscience
research has yet to emerge from recent creativity studies. While the popular TTCT
has picture- and verbal-based exercises to assess an individual’s possession of
mental creativity characteristics and divergent thinking abilities, it does not address
the contemporary construct of creativity and innovation that includes associative
abilities and an individual’s ability to overtly apply their creativity to real world
scenarios.
    In order to better assess a person’s creative capacity in real-world situations, we
developed a novel Design Thinking Creativity Test (DTCT) as a next generation
creativity assessment that reflects problem-solving needs of the twenty-first cen-
tury. In particular, the DTCT emphasizes assessment of case-based skills so as to
directly measure an individual’s application of creativity during an innovation
event.
    The Hasso Plattner Institute of Design at Stanford University (Stanford d.school)
offers classes specifically aimed at enhancing creative capacity through design
thinking skill building. As is custom in the academic tradition, instructors must
come up with ways to evaluate their student’s progression. Often, students are
asked to produce deliverables that are then judged by the instructors and classmates.
Although this method has academic value, its purely qualitative nature does not
allow a formal assessment or measurement on whether the student’s creative
capacity has been enhanced. In addition, merely possessing creative capacity is
differentiated from having the ability to activate and deploy creative capacity.
Thus, it is important to examine whether an individual’s creative capacity can be
objectively measured in real-world settings and enhanced.
    Thus, our primary question is: Can an individual’s creative capacity be
objectively measured in real-world settings and enhanced, and if so, what
essential characteristics should be part of the measurement and how does it
align with changes in brain functioning?
    The development of a new creativity assessment prompted other questions that
will be addressed in this chapter. How reliable is a case-base format in capturing
applied creativity in a simulated real-world setting? How do results from test
subjects’ performance on a creativity assessment correlate with functional neuro-
imaging data both pre- and post-creativity training? How do the results from the
DTCT assessment map to the results from the TTCT assessment? Can a case-based
creativity assessment measure the application of creativity skills?
    The creation of a statistically robust, well-standardized test relies on the acqui-
sition of large amounts of data during large-scale studies in which representative
groups of individuals take the test under standardized conditions (Zucker 2003). In
this project, we will align the DTCT with creativity training goals from the design
Designing a Creativity Assessment Tool for the Twenty-First Century:. . .           113
thinking methodology as our initial attempt to establish face validity. By doing so,
we hope to draw connections between creativity training, its effect on an individ-
ual’s creative capacity, and the potential impact of increased capacity from training.
By attaining this goal, we (and others) will be able better evaluate the effects of
training/teaching methodologies by utilizing a direct, twenty-first century assess-
ment instrument.
   We have already laid preliminary groundwork for a next generation creativity
assessment through research efforts from a collaboration between Stanford’s Hasso
Plattner Institute of Design and Stanford School of Medicine’s Center for Interdis-
ciplinary Brain Sciences Research (CIBSR), which were focused on: (a) how
creativity is manifested in the brain; (b) if training can enhance creative capacity;
(c) what are the associated neural correlates of such enhancement; and (d) whether
repeated training is required to maintain the enhancement of creative capacity. This
was performed through a rigorous longitudinal study that included two 5-week long
interventions/trainings sessions and fMRI brain scans, neurocognitive, and behav-
ioral assessments at three time points (Bott et al. 2014; Hawthorne et al. 2013;
Kienitz et al. 2014; Saggar et al. 2014).
For the purpose of this study, our definition of creativity is guided by the philosophy
of the Stanford d.school while insuring translation to the field of cognitive neuro-
science. We define creativity as “a state of being and adaptation of personal skill
sets that enables an individual to synthesize novel connections and express mean-
ingful outcomes” (Hawthorne et al. 2013). This definition captures the intersection
of three different axes. To determine how creative a person, deliverable or process
is, these components can be rated along three continuums from—(a) existing to
new/novel, (b) linear to synthesizing, and (c) no value/meaning to meaningful. We
propose a visual illustration of these continuums with three axes (Fig. 1a). A
deliverable or process with high novelty, meaning, and synthesis is considered
highly creative and so is the person responsible for this deliverable or process. This
person, the deliverable, or the process falls within the upper right and back zone of
114                                                                  G. Hawthorne et al.
the three-dimensional space created by these three axes (the zone in orange in
Fig. 1b).
   This definition of creativity focuses attention to the person and their skills as
opposed to process and outcomes as more traditionally defined. The intention of
this focus is to better align the skill of creativity to indications that go beyond the
possession of creativity into the ability to exercise/apply it.
In the extant literature, convergent thinking is defined as the ability to provide the
single correct solution to a given problem (Runco 2004). This ability to come up
with the right solution, given the constraints of a problem, is thought to require
(a) taking a novel approach to the problem; (b) seeing the problem from a different
perspective; or (c) making a unique association between the given/associated ideas.
The solutions are usually scored in a binary fashion, as either correct or incorrect.
   To test the convergent thinking capacity, researchers usually give participants a
set of insight (or “aha”) problems (Dow and Mayer 2004). The insight problems are
different from the broader class of well-structured problems, such that the solution
to an insight problem specifically requires a mental set shift or reconceptualization
of the problem and given constraints (Goel 2014). Once such mental shift occurs,
the problem solver can access the solution with ease. For example, consider the
famous insight problem—the triangle (Katona 1940). In this problem, the goal is to
arrange six sticks of equal length into four equilateral triangles. Until an individual
considers using the third-dimension of space, he/she cannot solve this problem.
However, once the shift occurs, and he/she realizes the use of three dimensions,
instead of two, the solution presents itself with ease. This phenomenological shift in
solving the problem is also referred to as having an “aha” moment.
Designing a Creativity Assessment Tool for the Twenty-First Century:. . .          115
alternate uses of a given item in fixed amount of time. For example, participants
could be asked to rapidly provide alternate uses for a “pen”.
   In summary, divergent thinking tests are designed to assess a participant’s ability
to rapidly respond and produce as many unusual or original ideas as possible. It is
important to note, however, that the focus in divergent thinking tests is on quantity
and not quality/utility of ideas. Thus, divergent thinking is focused on a specific,
albeit crucial, idea generation aspect of creativity.
In addition to testing the creativity process, via convergent and divergent thinking,
researchers have also focused on assessing the end product for its creative content.
Such assessments are commonly done for artistic products, e.g., paintings, story,
poem etc. Two or more field experts typically perform evaluations of such artistic
products.
   As artistic assessments are limited to professional artists or creative individuals,
the Barron-Welsh Art Scale (BWAS) was developed to capture similar assessments
in nonprofessional artists (Barron and Welsh 1952). The BWAS is a figure prefer-
ence test, which includes simple and complex figures that are liked and/or disliked
by established professional artists. The participant is shown these figures and is
asked to give his/her preference about their liking/dis-liking of the figure. An
individual scores higher on BWAS if his/her liking is inline with the artist’s
preference (Gough et al. 1979).
   Although BWAS scores of creativity have been shown to be associated with
other indicators of creativity (Gough et al. 1979), there is strong controversy
surrounding BWAS’ validity and the specific dimension of creativity assessed
(Ridley 1979).
2.5 Self-Assessments
In addition to assessing creative process and product, researchers have also inves-
tigated different aspects of the creative person him/herself using self-assessments.
Such self-assessments measure a participant’s creative confidence in generating
novel ideas and employing out-of-the-box thinking. Several self-assessment scales
were developed over the last 50 years, including Runco’s ideation and behavior
scale (Runco et al. 2001), the Creative Attitude Survey (Schaefer and Bridges
1970), the Creative Achievement Questionnaire (Carson et al. 2005), and more
recently, the Creative Confidence Scale (Royalty et al. 2014).
   Self-assessment scales provide a unique indicator for “personal” aspects of
creative thinking. However, such scales by themselves are not enough to measure
Designing a Creativity Assessment Tool for the Twenty-First Century:. . .          117
creative capacity and are generally used in conjunction with other measures of
creative thinking.
A case based approach that allows participants to deploy creative skills with
intention opens up the potential of assessing creative capacity in a manner that
goes beyond the domains of divergent or convergent thinking. By including the
creation and building of a real solution from a provided set of cues within a given
set of constraints, participants have the opportunity to create a deliverable with high
novelty, meaning, and synthesis within an assessment framework. But similar to
existing divergent and convergent thinking assessments, in this case based
approach, participants also have the opportunity to both conceptualize numerous
possible solutions and narrow on one particular outcome of their choosing during
the assessment. The case based approach that we have created is a next generation
hybrid that builds on pertinent aspects of previous creativity assessments. Our case
based approach focuses on assessing applied thinking and adaptability of solutions
when faced with new constraints.
118                                                                  G. Hawthorne et al.
The DTCT was developed around Stanford d.school’s design project structure, a
fast-paced approach that runs the participant through a full design cycle. Instead of
pairing up with another person in a typical design project, participants administered
the DTCT run through a course of prompts and activities to create a design solution
that is “useful and meaningful” to a hypothetical user.
   In the course of 30 min, participants employ characteristics of applied thinking
and adaptability for need-finding as well as developing novel solutions. The “case
study” assessment begins with a photographic image that consists of people in an
environment. Three versions of the DTCT were created for up to three different
assessment points; all have the same twelve steps and prompts, and vary in the
opening environment and related condition change (Fig. 2).
and later on rate each prototype, photographs of the prototypes were taken after
completion of the respective prototyping steps. In our prototype, each study partic-
ipant did the DTCT on three different occasions over three measurement periods
during a 12 weeks period. Because the structure of the assessments were similar,
multiple versions could be administered simultaneously.
   A large amount of multi-dimensional data already collected from the aforemen-
tioned creative capacity study will allow us to refine and improve the DTCT by
investigating associations between different domains of the DTCT and other
measures of creativity, cognition, personality and brain function.
5 Data Analysis
The data collected during each DTCT assessment include photographs of the
prototypes developed, as well as in process thoughts and designs scribbled on the
paper booklet. After de-identifying and scanning the booklets and photographs, a
web-based assessment tool was generated for each participant. Using this
120                                                                  G. Hawthorne et al.
web-based tool, the raters (at least two) assess and evaluate the work in an efficient,
reliable and economic fashion. To validate the novel DTCT assessment, we are
currently conducting the following three analyses.
The reliability of the DTCT over time is an important component of establishing the
overall validity of the tool. We will utilize the individual DTCT scores from
participants before and after they receive the Language Gym to perform test–retest
reliability. We hypothesize that 5 weeks of Language Gym should not significantly
affect an individual’s creative capacity. Thus, high correlation values between
DTCT scores before and after Language Gym are expected.
To examine how much value DTCT adds above and beyond the standardized tests
of creativity (e.g., TTCT), discriminative validity analysis (DVA) will be
performed. In this approach, we hope to show that the DTCT provides additional
information about the cognitive skills associated with creativity as well as inter-
vention related outcomes. For DVA, we will use measures of executive functioning,
intelligence, personality, as well brain imaging data.
The standardized administration, scoring procedures and norms, and the develop-
ment and evaluation of the DTCT will begin with already administered tests. Given
the objectives and steps included on the test, norm-referenced measures of creative
capacity and application will be scored by a host of categories derivative of the
TTCT’s widely used scoring workbook (Ball and Torrance 1984). The categories
will be constructed around key design thinking attributes in the assessment. We will
ask the Stanford d.school’s most gifted design thinkers to take and score the test to
establish the highest possible composite score. We will apply the prototype scoring
guide and norm to the already administered tests from the creativity capacity study
(108 tests) and to the wider test population that will take the DTCT for this study to
refine reliability and validity of the assessment.
7 Looking Ahead
Wider testing of the DTCT prototype will be executed to refine assessment format
and improve simplicity and test structure. This will include assessment of the
reproducibility of the instrument by administration of alternate forms of the
DTCT to subjects within a proscribed period of time. In addition, full analysis of
122                                                                        G. Hawthorne et al.
newly administered DTCT test scores to additional subjects will be performed and
correlated with the subject’s TTCT scores before, during, and after a time period
that will include creativity training.
   By creating a psychometrically robust creativity assessment measure as a com-
panion to the TTCT, we will be able to create an assessment tool that can map
creativity training to development and real world practice to impact. These findings
will help guide instructional content and training exercises for creativity training/
intervention in a large number of settings. Increasing creative capacity in individ-
uals goes beyond classroom methodology to applied execution in real world
scenarios for impactful change. The DTCT has the potential to become a new
industry assessment norm for business professionals, educators and executives
across all disciplines and industries as the creativity assessment that matches
problem solving needs of the twenty-first century.
In order to reduce potential confounding covariates from the original prototype and
increase statistical robustness, we will administer the DTCT to a larger sample of
subjects for this study. We will correlate quality and quantity long-term data from
the subjects that participated in the creativity capacity building project, namely
information from both neuroimaging and creativity assessments will be analyzed.
We will administer the DTCT to two-three additional graduate-level courses at
Stanford’s d.school to further refine the test language, to create a test administration
manual, and to strengthen the establishment of a norm group for reliability and
validity. The results from the wider administration of the DTCT will also help us
determine the effectiveness of the case-driven hybrid format as real-world scenario.
References
Ball OE, Torrance EP (1984) Torrance tests of creative thinking streamlined scoring workbook,
   figural A. Scholastic Testing Service, Bensenville, IL
Barron F, Welsh GS (1952) Artistic perception as a possible factor in personality style: its
   measurement by a figure preference test. J Psychol 33(2):199–203. doi:10.1080/00223980.
   1952.9712830
Bott N, Quintin E-M, Saggar M, Kienitz E, Royalty A, Hong DW-C et al (2014) Creativity training
   enhances goal-directed attention and information processing. Think Skills Creat 13:120–128.
   doi:10.1016/j.tsc.2014.03.005
Carson SH, Peterson JB, Higgins DM (2005) Reliability, validity, and factor structure of the
   creative achievement questionnaire. Creat Res J 17(1):37–50. doi:10.1207/s15326934crj1701_4
Dietrich A, Kanso R (2010) A review of EEG, ERP, and neuroimaging studies of creativity and
   insight. Psychol Bull 136(5):822–848. doi:10.1037/a0019749
Dow GT, Mayer RE (2004) Teaching students to solve insight problems: evidence for domain
   specificity in creativity training. Creat Res J 16(4):389–398. doi:10.1080/10400410409534550
Designing a Creativity Assessment Tool for the Twenty-First Century:. . .                      123
Goel V (2014) Creative brains: designing in the real world. Front Hum Neurosci 8:241. doi:10.
   3389/fnhum.2014.00241
Gough HG (1979) A creative personality scale for the adjective check list. J Pers Soc Psychol
   37:1398–1405
Guilford JP (1950) Creativity. Am Psychol 5(9):444–454
Hawthorne G, Quintin E-M, Saggar M, Bott N, Keinitz E, Liu N (2013) Impact and sustainability
   of creative capacity building: the cognitive, behavioral, and neural correlates of increasing
   creative capacity. Springer, Cham, pp 65–77. doi:10.1007/978-3-319-01303-9_5
Katona G (1940) Organizing and memorizing: Studies in the psychology of learning and teaching.
   Morningside Heights, Columbia University Press, New York
Kienitz E, Quintin E-M, Saggar M, Bott NT, Royalty A, Hong DW-C et al (2014) Targeted
   intervention to increase creative capacity and performance: a randomized controlled pilot
   study. Think Skills Creat 13:57–66
Mednick S (1962) The associative basis of the creative process. Psychol Rev 69(3):220. doi:10.
   1037/h0048850
Mendelsohn GA (1976) Associative and attentional processes in creative performance1. J Pers 44
   (2):341–369. doi:10.1111/j.1467-6494.1976.tb00127.x
Ridley DR (1979) Barron–Welsh scores and creativity: a second look. Percept Mot Skills
   49(3):756–8
Royalty A, Lindsay NO, Bernard R (2014) Acting with creative confidence: developing a creative
   agency assessment tool. In: Plattner H, Meinel C, Leifer L (eds) Design thinking research.
   Springer International Publishing, Heidelberg, pp 79–96
Runco MA (2004) Creativity. Annu Rev Psychol 55:657–687. doi:10.1146/annurev.psych.55.
   090902.141502
Runco MA, Plucker JA, Lim W (2001) Development and psychometric integrity of a measure of
   ideational behavior. Creat Res J 13(3–4):393–400. doi:10.1207/S15326934CRJ1334_16
Saggar M, Hawthorne G, Quintin E-M, Kienitz E, Bott NT, Hong D et al (2014) Developing novel
   methods to assess long-term sustainability of creative capacity building and applied creativity.
   In: Plattner H, Meinel C, Leifer L (eds) Design thinking research. Springer International
   Publishing, Cham, pp 29–39, doi:10.1007/978-3-319-06823-7_3
Schaefer CE, Bridges CI (1970) Development of a creativity attitude survey for children. Percept
   Mot Skills 31(3):861–862. doi:10.2466/pms.1970.31.3.861
Torrance EP (1990) Torrance tests of creative thinking. Figural forms A and B. Scholastic Testing
   Service, Bensenville, IL
Wallach MA, Kogan N (1965) Modes of thinking in young children. Holt, Rinehart & Winston,
   New York
Zucker S (2003) Fundamentals of standardized testing. Pearson Inc, New York
Innovation in Creative Environments:
Understanding and Measuring the Influence
of Spatial Effects on Design Thinking-Teams
2009; Dunne and Martin 2006; Dym et al. 2006). Overall it represents a mind-set
including a repertoire of tools, methods and techniques that can support a profound
shift in organizational problem solving (Burry 2005; Dunne and Martin 2006).
Organizations have started to adopt some of the specific problem solving practices,
methods and tools, that have been practiced by designers in dealing with framing
and re-framing complex design challenges.
   Recent publications show the growing consideration of designing creative office
environments (e.g. Groves and Knight 2010). The increasing occupation with
spatial factors in the daily routine of individual office workers and employees is
largely supported from two sides: The technical side enables increasing work
digitalization and mobility. On the scientific side there are studies showing that
productivity increases of up to 50 % can be achieved for office workers if spatial
factors are considered (Walden 2011). Research results in neurology cast new light
on the way we learn and work. Since industrialization, we have become used to
environments for learning and working that do not suit the capacities we have
developed over the course of evolution. John Medina goes so far as to say: “Anyone
wanting to create a professional environment that diametrically opposes the special
capacities of the brain will probably end up with a small office” (Medina 2014,
p. XV).
   However, there is a need to establish effective workspaces in order to enable
appropriate spatial interactions as well as to create physical environments for
diverse innovation activities (Moultrie et al. 2007, p. 53). Although there seems
to be a consensus that the office environment has an effect on productivity of the
individual employee, guidelines and methods for setting up new spaces, which suit
in particular the specific needs of teamwork, its culture and objectives, are missing.
According to Thoring and Müller (2011) a flexible working environment enables
and supports the ability of teams to innovate, create and design. Peters (1992,
p. 413) depicts space management as probably one of the most ignored and
simultaneously the most powerful tool for the implementation of cultural change
within organizations to foster innovation and learning within organizations. Work
environments for creativity are often designed rather intuitively or based on often-
cited examples of best practices (e.g. IDEO in the 2000s, Google nowadays). For
example Magadley and Birdi (2009, p. 315) discuss the advantages of innovation
labs as “dedicated physical environments or facilities with collaborative work
spaces in which groups and teams of employees can engage with each other in
order to explore and extend their thinking beyond and above normal boundaries”.
They suggest that spaces designed for fostering creativity have a different physical
layout, a range of high- and low-tech supporting tools, and expert facilitators (see
Moultrie et al. 2007) and show that such spaces have a positive impact on creativity
and on participants’ attitudes towards it. However, “the research studies on the
spatial support of creative thinking with reference to the arrangement of a
creativity-promoting work environment have not yet resulted in any generally
valid, integrated complete solution” (cf. Rieck 2011, p. 43). Only insufficient and
rather fragmented academic research has been conducted on the attributes of
effective physical environments supporting creativity and innovation of teams.
Innovation in Creative Environments: Understanding and Measuring the. . .          127
Yet there is limited scientific proof of the benefits or the wider effects of the work
environment on team performance and team wellbeing.
Bearing in mind the discussion about the use of cultural probes, we applied the
method accordingly and designed the probes primarily to provoke inspirational
responses from members of innovation teams concerning their perception of space.
The cultural probes are used to identify beneficial and hindering spaces and spatial
attributes within the context of the innovation space. We decided to additionally
conduct qualitative interviews with the participants after the return of the cultural
probes. This helped us to better understand their responses and offered an oppor-
tunity for further explanations. Through this we were able to conduct a qualitative
content analysis of the collected data that led to an insightful outcome that will
provide the basis for further experimental research to be conducted in the future.
    The cultural probes tool-kit was designed to investigate the influences that the
spatial environment might have on creativity in innovation processes and wellbeing
in design thinking-teams. It contained three different tasks that helped us to
understand the perception and the influence that spatial factors might have during
a complete innovation project. Figure 1 shows the design of the cultural probes tool-
kit highlighting its rather playful nature.
    The cultural probes entailed three parts that were to be completed over a period
of 3 weeks. To suggest an order of completion the tasks were put in three separate
envelopes and numbered. Yet the participants were free to work on the tasks at any
time during their innovation project. Each part was estimated to take at least 45 min.
In addition to the data collected through this method, short qualitative interviews
were conducted with the participants when the boxes were returned to the
research term.
    The tasks were designed to measure different aspects. Task 1 aimed at
documenting the use of different spatial settings throughout the whole innovation
process of the team. Additionally team members were asked to indicate their
individual and the team’s emotional state when working in different locations.
Task 2 focused on the design of specific spaces and its impact on fostering specific
working modes in terms of team wellbeing: teamwork (creation, ideation and
prototyping), collaboration (knowledge, learning, and reflection), presentation
(share and display project outcomes), as well as communication (socializing and
relaxation). Task 3 was rather visionary and asked the participants to create their
personal dream space.
    To motivate the participants over the 3 weeks of time, incentives, such as a
chocolate bar, were added. Additionally practical items (i.e. pens, glue, markers,
scissors etc.) were added. All these tools were of high quality. As further incentive
for participating the students were free to keep all material that was not used to
complete the tasks. The cultural probes were wrapped in a cardboard box with the
size of a DIN A 4 folder, to be carried home and brought back easily.
    We selected different student teams of HPI School of Design Thinking at
Potsdam as the study group. The sample size consisted of three different innovation
teams working on a design project over a period of 8 weeks. The sample size
consisted of 12 participants in total. This group was chosen due to its diverse nature,
which results from the multidisciplinary background of design thinking teams. Due
to the fact that the students together formed three teams, it was possible to compare
their individual perception with the perception of other team members as well. All
participants were moderately experienced in design thinking and went through a
6 weeks-training course before starting their innovation project.
Previous work by Gensler (2008, 2013), Groves and Knight (2010), and Doorley
and Witthoft (2012) suggests that performance and wellbeing of innovation teams
could be improved when different spatial settings are used for distinctive tasks.
    The HPI D-School offers a variety of spaces that either promotes teamwork,
meeting spaces, lounge spaces, workshop spaces, reading and silent spaces as well
as spaces for individual work. In this research setting each team was assigned to
their own team space in an open studio setting shared with eight teams in total. Each
team space is typically equipped with moveable whiteboards, one flexible table,
high chairs and material for teamwork. Spaces commonly used by all teams are the
presentation space equipped with various projectors and flexible seating (including
sitting cubes, chairs and sofas). The lounge as well as the kitchen is the assigned
communal space (including various sofas, bean bags, and sitting cubes) whereas the
library is used as a reading room and a room for silent work (for individuals). The
130                                                                          C. Nicolai et al.
workshop spaces (one for digital prototyping and one for handicraft work) can be
found on different floors and offer facilities for teams as well. If weather conditions
are permitting it, teams can also use the patio in front of the building and a
teamwork pavilion (designed to host a maximum of two teams). Because different
spaces can foster different work modes the teams have been encouraged to change
their workspace according to their innovation project and team wellbeing needs.
   The analysis of the usage patterns of the three teams under scrutiny revealed that
the design thinking-teams have used all different spaces during their project.
Figure 2 shows an example of a team with dots indicating the usage and emotional
states associated with different spatial settings during the whole innovation process.
All three teams displayed the same usage patterns in terms of time spent at the
different locations. It is interesting to notice that the teams quite often moved to a
different space when they felt stuck in the innovation process as one interviewee
underlined: “We definitely had great ideas here [in the team space] but most of the
breakthrough moments happened when we changed to another space.” And another
person said: “What I did with my team was to push them out of the space where we
were currently working when we got stuck. .... When we entered a new space,
outside or inside, we felt unblocked by the surrounding itself.”
   These findings indicate that team performance can be fostered by the change of
the team’s workspace location. Overall, we discovered that the teams did not only
change the location of their teamwork based on the stage of the innovation process
stage they were currently working on; e.g. moving to the workshop space to build
prototypes, moving to the communal space to relax, or finding a quiet space in the
library to reflect—individually—on their team performance. They quite often
changed their spatial setting as a mean to ignite their creative potential and
therefore team performance. Quite often the teams went to seek a space that
seemingly offered them the opposite wellbeing situation to the current state.
When the assigned team space was considered as too noisy and too distracting they
moved outdoors or to the library. Thus, a lot of “misuses” of additional spaces took
place over the time of their innovation project, e.g. using the silent space of the
library as the place to perform an intense—and loud—round of ideation, working in
the workshop space (in the basement) to find a—silent—space without any noise
and other teams around them for individual “thought-work” without touching any
of the tools.
    In addition to the use of the various spaces for the distinct tasks of an innovation
process we also asked all team members to share with us their assessment of their
individual wellbeing and their perceived wellbeing of the whole team in different
locations. Our interviews after the completion of the cultural probes revealed that
the team members were able to indicate very accurately their own emotions about
how they felt individually about working with their team in the different locations
but were not able to express their team’s overall feelings with the same accuracy. It
is also worth noticing that we have got completely different answers from the
members of the same team about their individual feelings about working in a
specific space, in particular for their assigned team space and the outdoor spaces
(either on the patio or the lawn in front of the building). For example, one team
member stated that he felt energized “. . . when we went outside and walked around
barefoot on the ground” whereas another member of the same team stated that he
did not think it helped the team’s progress because “. . . we lacked the tools to take
all relevant information with us to our temporary workspace outside.” Therefore it
is no wonder that the assigned team space—inside—and the most often chosen
change to a temporarily team space—outside—were the two most often marked
spaces in our cultural probes set that received positive as well as negative conno-
tations as working spaces. To quote two other interviewees: “In our assigned team
space we felt imprisoned. . . . Trying to create ideas was not much fun here but
everywhere else.” Another person stated: “This was OUR team space with a lot of
emotions going on . . . You should really explore what YOUR TEAM SPACE can
do for you as a team.”
    When analyzing all individual emotional states associated with the use of
specific spaces we found an interesting relationship with the frequency of usage.
The spaces that were used most often are in the order of frequency: the assigned
team space, outside spaces, presentation space, workshop spaces, lounge, and
library. In particular the first two ones received positive as well as negative
emotions and feelings: “What I personally found interesting is that our own team
space it the place where we felt most confused.” And another test person stated: “In
the outside pavilion we had great moments but it also sometimes did not work out at
all.” The three latter ones received positive emotions as well as where spaces
individual team members were indicating that the teamwork had breakthrough
moments with regard to the team’s performance.
    Overall, positive emotions are most often experienced at places to socialize or to
recreate (e.g. lounge and kitchen). Spaces to create and make (e.g. workshop
spaces) as well as sharing spaces (e.g. presentation spaces) are also perceived as
uplifting. Whereas at those spaces where most of the intense teamwork
132                                                                   C. Nicolai et al.
(concentration and convergent thinking) took place (e.g. team spaces and outdoor
spaces) are often related with mixed emotions.
 Cultural probes task kit to visualize the         Examples of illustrated postcards done
 individually preferred spatial environment        by the participants of the study
 for different working modes
Fig. 3 Example of the cultural probes task on spatial settings for different working modes
category. Furthermore, the participants were asked to record the surroundings and
to take pictures of analogous places they experienced in real life and which they see
as objects of reference for the specific working modes.
    It is interesting to notice that in particular for the “build and make”-space the
range of individual preferences is very broad. The individually mentioned prefer-
ence for the surrounding sound can exemplify this: from silence to soft music
without lyrics to loud electronic beats nearly everything was mentioned. This
already indicates one of our key findings: individual preferences are not always
compatible with each other. Nonetheless the content analysis led us identify two
different patterns: The participants were distinguishing ‘spaces for inspiration’
from ‘space for doing’. Inspirational spaces offer a lot of visual clues, different
kinds of material at display, and different seating and working positions/working
stations. This describes an informal atmosphere with enough space to walk around
and to change positions. Some of the participants even referred to being in nature as
their ideal inspirational space. Spaces for ‘doing’ are associated with a clean and
tidy workshop space with access to all tools needed (for building as well as digital
prototyping).
    The associations and feelings about the “present and show”-space showed some
differences as well. Overall the participants clearly distinguished between spaces
and situations in which merely presenting—and not sharing—takes place
(an atrium, a stage) from situation where both take place. For the first one, different
participants distinguished between real and virtual presentations. For the latter they
wish for a comfortable and relaxing situation for the audience and an opportunity to
interact with them. Thus, rooms should not be distracting too much.
    The space to “communicate and reflect” is obviously a space for the whole team
and/or for the individual alone. The participants depicted different situations in
which they were alone, in smaller groups of two to three people or in their whole
team. It is interesting to notice that preferred spatial settings never included a larger
134                                                                    C. Nicolai et al.
group or audience. The ideal locations to practice this working mode range from the
assigned teamwork space to a walk in the park (alone or with another team
member). The participants mentioned that they need tools for this, like pens,
post-it’s or a notebook, screens for visualization, surfaces to write on, etc.
   Spaces to “relax and socialize” were also associated with a broad variety of
settings, activities and people. Besides the fact that all participants mentioned
practicing sports, being with friends, eating and drinking, and spending time in
nature the prominent overarching topic has been a surrounding trustful atmosphere.
Thus, either the space itself or the time dedicated to it should provide the partici-
pants with the feeling that they are—more than—allowed, invited, to do so: relax
and socialize.
   Both activities, designing the postcards and doing the visual and audio record-
ing, together composed a colorful picture that provided valuable insights about the
individual preferences for the different additional spatial categories. In the post-
cards activity, the students were free to merge experiences of multiple spaces to
create a catalogue of characteristics for each spatial category. However, this
included a further layer of interpretation because individual preferences displayed
a large variety of possible spatial solutions.
The third part of the cultural probes set was merely visionary. Assuming that the
tasks were completed in the assigned order we expected the participants to have
gained awareness about their spatial surroundings and the topic of spaces for
creativity in general. This created the mindset to design the creative space of
their dreams. To fulfill this task, the participants were provided with material and
tools. We decided to provide 36 samples of material for interior design including
printings of surrounding elements such as clouds or water. The haptic experience of
different materials supposedly helped the students to better imagine the use.
Additionally the students were asked to add people-posture-cards that we provided.
These additional 36 cards showed different sizes of groups and singular people in
different working positions (e.g. standing, sitting, etc.). This gave us a hint of the
activities assigned to the space and encouraged the students to keep these in mind
when designing the space. Purposely no floor plans or pre-designed forms were
provided. The spaces were to be created on a blank paper. On return of the boxes,
we realized that a few even used the wrapping, hence the cardboard box and created
a 3-D model of the space (see Fig. 4).
   On average the participants have used 11 of the 36 provided material samples
and 8 of the 36 different working positions. Figure 5 shows the most often used
material samples.
Innovation in Creative Environments: Understanding and Measuring the. . .                     135
 Cultural probes task kit with material             Example of a creative space for innovation
 samples                                            designed by one participant
Fig. 4 Example of the cultural probes task on designing creative spaces for individuals and teams
18 15 11 11 10 10
9 9 9 7 7
7 6 5 5 5 5
   When analyzing the aesthetics of the used materials we were able to identify two
different material combinations: On the one hand we found the combination of
rather more natural materials (plank floor, cork, grass carpet) for floors with a
translucent atmosphere (frosted glass, wallpaper with the depiction of skies and
water). On the other hand the combination of natural materials (plank and concrete
floor) with a more industrialized, but bright atmosphere (different textured mate-
rials in white, grey and red, brick and white wall).
   The individual designs of ideal creative spaces consisted typically of three
different working zones. All of the 12 designs included a designated zone for
team spaces. The two other zones are a recreation space (designed 9 times),
presentation space (6 times), kitchen as lounge (5 times), silent space (5 times),
136                                                                     C. Nicolai et al.
Previous research in the field of creative environments has been focusing on the
description of different physical characteristics of alternative environments for
innovation teams. However, if doing so without considering the usage of the
space by innovation teams and its impact on team wellbeing and performance
will provide little insights into the role and effectiveness of place and place-making
(Moultrie et al. 2007). The authors have also experienced many different examples
of innovation spaces during their analysis of best practices, e.g. co-working space,
innovation greenhouses and idea saunas.
   Using the method of cultural probes we were able to define a research method
that helps in particular to reveal the impact of creative environments on team and
individual wellbeing over time. Overall, the method of cultural probes has been
used to identify beneficial and hindering spaces and spatial attributes within the
context of the creative space during an innovation project of teams, to discover
different spatial requirements for diverse working modes outside the individual
teamwork environment as well as to collect the designs of the ideal, imagined
innovation spaces of the participants. It needs to be noticed that this has been a
rather descriptive qualitative study and that the impact of creative environments on
team performance is still a research question under scrutiny.
   However, we were able to identify some key findings: Overall, different patterns
could be detected concerning the emotions of innovation team members in different
team working spaces—at different times. As the participating innovation teams
were free to move around the HPI D-School building and use the spaces for
different purposes besides only their assigned team-space, it was difficult to clearly
define the impact of a space. For example the library was used for an ideation
session, hence a creative act, while others used it to concentrate and reflect. Yet, it
can be derived that switching spaces might trigger the switch from being a “stuck
team” to a team seeking a new start for their flow of creativity with positive
emotions and through this promotes creativity. The collected data from the second
Innovation in Creative Environments: Understanding and Measuring the. . .          137
task of the cultural probes set proved that preferences concerning the set-ups for
different working modes in the design thinking process are very subjective as they
differed greatly. For some, listening to loud and fast music seems to be a means to
recreate, while others prefer a silent surrounding. For some participants open and
crowded spaces are fostering creativity while others like clean and separated
spaces. The third task confirmed the aspect of subjective preferences. Innovation
teams need different working zones; not only one assigned team space. All partic-
ipants completed the task to imagine the creative space of their dream by designing
a compilation of several spaces that were assigned to different tasks or functions. In
particular zones for sharing, connecting and learning seem to be important for team
wellbeing. A reduced design combining rather natural materials with a bright
atmosphere and a touch of industrial aesthetics with steel, bricks and a lot of
white in general seem to be a common wish for creative spaces.
    It would be interesting to further investigate if the above-mentioned preferences
for outdoor (innovation and recreation) activities as breakout are a phenomenon due
to the fact that the study has been conducted over the summer in Germany. We
suppose that this depicts another important aspect to consider in the configuration of
innovation spaces: the design and usage of team workspaces and individual work-
places as well as the ability to change between different, physically active, loud,
and visual vibrating team-oriented working phases, and quiet, thoughtful and
reflective individual-oriented working phases. The expertise of one author being
involved in the design and setup of a D-School in Malaysia underlines this conclu-
sion. Local weather conditions (high temperatures, humidity, and rain) prevent
working outside and breakout-spaces for individuals had to be—and could be—
created indoor as well.
    Bearing in mind all the above, it can be concluded, that subjectivity in aesthetics
plays an important role once it comes to the design of innovation spaces. Nonethe-
less, what all participants have in common is the need for different creative spaces
and working zones fitting the different tasks and activities throughout an innovative
team process.
    (Mental) freedom is needed to adjust the workspace according to, on the one
hand, the tasks to be done at a certain point in the process, on the other hand, to set
up the space to meet individual (team’s) preferences. This accounts to the model of
Freiraum, that Auernhammer and Hall (2013) defined to be promoting creativity.
Furthermore, it seems that creativity was fostered when teams were encouraged to
select a specific space to work at and then transformed it to make it their own. The
flow of emotions went up when teams moved to another work zone and had to
transform it—temporarily—to make it their own.
    Overall the empirical study, as described above, showed that having the freedom
to choose is a promoting factor for creativity. Yet, creating a space that is a blank
page, does not seem to help. A space that fosters creativity rather suggests a
direction of usage but also leaves room to change it.
    This conclusion opens up two fields for future research: The first topic to be
addressed is the academic rigor given to the relationship between creative spatial
settings and the perception and evaluation of team wellbeing and performance. In a
138                                                                             C. Nicolai et al.
follow-up study we would like to analyze team processes in-situ to add insights to
our retrograde approach with cultural probes. The second topic encompasses the
innovation team process itself. We would like to better understand how to enable
innovation teams to make the choices that best possibly promote their creativity.
This action-based research approach will shed further light on creating and using
the innovation space due to the team’s changing tasks and activities throughout the
process.
References
Starkey K, Tempest S (2009) The winter of our discontent: the design challenge for business
   schools. Acad Manage Learn Educ 8(4):576–586
Thoring K, Müller R (2011) Understanding the creative mechanisms of design thinking: an
   evolutionary approach. In: Proceedings of the second conference on creativity and innovation
   in design, pp 137–147
Walden R (2011) Auch mal im Garten arbeiten. In: Deutsches Architektenblatt (08/2011),
   pp 14–17
Zollenkop M (2006) Geschäftsmodellinnovationen. Deutscher Universitäts, Wiesbaden
Building Blocks of the Maker Movement:
Modularity Enhances Creative Confidence
During Prototyping
Fig. 1 Plug-and-play
interaction with electronics
modules
Can we enable anyone to create anything? Maker culture encourages those with ideas
to empower themselves with the creative tools to make their ideas tangible. In the
world of electronic devices, the rising availability of novice microcontroller platforms,
desktop physical fabrication tools, affordable electronic components, and a global
online community, has fuelled more designers to hack, mash and glue their way to
working electronic prototypes (Hartmann et al. 2008). Hundreds of DIY fabrication
facilities, Fablabs, have emerged over the years to provide public spaces for digital-
physical prototyping, ranging from laser cutters, and computer-controlled milling
machines, and 3D printers (Anderson 2014). Tools that were once only accessible to
large companies, or required professional knowledge, are now within reach of ordinary
citizens. In the electrical domain, novices can now chose from a growing array of
premade electronics modules, such as sensors and actuators (Fig. 1), that hide much of
the technical detail into plug-and-play LEGO-like building blocks (Sadler et al. 2015a,
b). These emerging tools have strong implications for Design Thinkers who wish to
transform low-resolution prototypes from paper and duct-tape, into functional inter-
active devices. But what are the core building blocks that might make up electronics
toolkits of the future, and what are their key affordances? Here we propose the idea that
modularity, the ability to freely recombine elements, is a critical affordance for novice
prototyping with electronics. What are the creative trade-offs with more modular
electronics blocks? Do these blocks help or hinder the creative design process?
When faced with a creative challenge, designers are encouraged to create many
low-resolution solutions in a limited time period. These prototypes tend to highlight
the essence of an idea, rather than the technical details of implementation. However
prototyping with software and electronics can slow down the creative process by
(i) requiring prior technical knowledge and (ii) introducing potential for errors in
implementation. By using pre-defined modules, a designer may add blocks that
encapsulate a desired functionality, while hiding the technical details. Here we
define modules as encapsulated blocks of functionality that can be added to or
removed from a system independently (Baldwin and Clark 2000).
    A component with a high degree of modularity has fewer dependencies on
outside variables. In prototyping, this implies that modules enable designers to
Building Blocks of the Maker Movement: Modularity Enhances Creative. . .              143
freely try combinations of parts, much like adding bricks in a toy construction kit.
For example if the designer decides to add a green LED light to a prototype, a plug-
and-play interaction is a more modular interaction than building a light circuit from
basic components. We can think of the light module as a “black box” that does not
reveal its inner details, but rather emphasizes the core functionality. For designers
who are trying to create novel solutions, the idea of “black box” that cannot be
modified on the inside seems counter to the idea of creative freedom. On the other
hand, the “black box” approach allows the designer to focus on the high-level
design decisions such as, “Should we include a light?” before diving deeper into
technical requirements such as, “What value of resistance should we include to
limit the current?” Prototyping with black-box modules therefore has tradeoffs
between flexibility and ease of use. The goal of this research is to better understand
the effects of these modular tradeoffs on the designer’s creative ability.
2 Background
What does high or low modularity look like with electronic components? Consider
the following example of two multi-color light emitting diode (LED) components
shown in Fig. 2. In the example of “low modularity,” a low-cost tri-color LED can
be added to a programmable microcontroller by connecting four individual pins,
and choosing 3 appropriately sized resistors. In this case, the LED does not contain
all of the components required to function reliably, and a designer must choose the
correct resistors from thousands of possible resistance values.
    Making an incorrect connection, or choosing a resistor that is too small or too large,
results in a non-functional light. In the “high modularity” example, a self-contained
LED module contains the necessary components to function properly and a single
connection to the outside world. In this case, an expert has made the resistor choices in
advance, and the module has a simplified single connection interface. This module
reduces the outside variables that may lead to errors, such as choosing bad resistors or
plugging in the component in an opposite polarity. These contrasting examples
illustrate that a key difference in prototyping with modules is that they tend to reduce
the variability, difficulty, and number of steps required to combine components, in
exchange for reduced flexibility and increased cost. These trade-offs are summarized
in Table 1. Modules may be a poor choice if one’s goal is to educate a designer on the
technical aspects of a system, since the details are hidden. However, if the primary
goal is to enable functional prototyping of a creative idea as quickly as possible, then
modules are effective candidates. Modules are therefore the “chunks” that enable a
designer to incorporate a whole unit of functionality; in much the same way that
cognitive chunking (Miller 1956) enables efficient clustering of complex information.
Here the authors use the concept of a “chunk” as equivalent to a module, as proposed
by the design decomposition work of Pimmler and Eppinger (1994).
During prototyping we assume that an ideal creative toolkit will enable a designer
to confidently create many prototypes in a given time frame, allow diverse ideas to
be expressed, and place a designer in “the zone” of continuous cognitive flow. Since
modularity tends to reduce the difficultly and number of steps required to create a
prototype, we hypothesize that increasing the modularity of prototyping tools will
have positive effects on:
1. Prototype Quantity: The number of distinct prototypes created in a given time
   period.
2. Designer’s Creative Feeling: The degree to which designers feel they are in a
   creative state.
3. Self-Efficacy: The confidence that a designer has in his or her ability to create
   prototypes. (Csikszentmihalyi 1992).
4. Cognitive Flow: The degree to which a designer feels that a task’s difficulty
   level matches one’s perceived abilities (Csikszentmihalyi 1992).
Building Blocks of the Maker Movement: Modularity Enhances Creative. . .             145
   The above metrics have interconnected effects on one another. For instance,
prior research has shown that designing several prototypes in parallel can contribute
to an increase in both the quantity and creativity of prototypes produced (Dow
et al. 2010). This research focuses on the above metrics as general indicators of a
desirable prototyping experience.
Creating prototypes with technology can be difficult for a technical novice. A designer
who lacks a technical background in electronics and sensors, for example, may be less
confident in his or her ability to make working prototypes that include electronic
components. Bandura (1977) refers to the measure of a person’s confidence in his or
her ability to achieve a task as self-efficacy. In this research we consider a specific
aspect of self-efficacy, referred to as creative self-efficacy (Tierney and Farmer 2002).
For a prototyping task, we aim to understand how the qualities of a tool will encourage
designers to feel confident in their ability to create. Prior work on the use of novice
electronic prototyping toolkits found that using modular building blocks can be an
effective way to encourage more fluid prototyping (Hartmann et al. 2005). The core
premise of Hartmann’s work is that modularity hides some of the technical details and
reduces the perceived and actual difficulty to play with an unknown component.
However, to the authors’ knowledge there is limited prior data on the cognitive effects
of modularity during creative prototyping.
3 Related Work
In order to examine the effects of tool modularity on the creative design process, we
need a repeatable and modifiable task to serve as a basis for measuring prototyping
performance. The methods we chose were modeled on two prior studies, one
examining conformity in creative generation tasks (Marsh et al. 1996), and another
by Kulkarni et al. (2014) that uses a crowd-sourced method to explore a creative
prototyping task.
Marsh et al. (1996) illustrated that presenting design examples to participants prior
to a generative design task influences one’s creative output. Participants were
shown pre-made examples of alien creatures and then asked to draw as many
unique creatures as possible in a given period of time. The study found that showing
design examples in advance increased the degree of conformance among design
ideas. We consider this study as a strong model of a creative prototyping task, with
a controlled manipulation. Specifically, we see a parallel between electronic mod-
ules and the pre-made design examples in the study by Marsh et al., since modules
can be considered as pre-made units of function that a designer is provided in
advance and may choose to include in a prototype.
In building on the prior study, Kulkarni et al. (2014) examined how creative output
is affected by the timing when design examples are shown to study participants.
Their research found that early and repeated exposure to examples increased
Building Blocks of the Maker Movement: Modularity Enhances Creative. . .                        147
4 Methods
Our research aims to understand how modularity affects creative prototyping from
both quantitative and qualitative perspectives. For a controlled design task, we
hypothesize that:
H1 Increasing the modularity of a prototyping tool enables designers to generate a
higher quantity of prototypes in a given period of time.
H2 Increasing the modularity of a prototyping tool increases a designer’s creative
feeling, perceived self-efficacy, and cognitive flow. [Conversely, increasing a
designer’s exposure to technical details while prototyping (e.g., through the number
of wiring options) decreases these cognitive measures.]
   To test these hypotheses, we provided a generative prototyping challenge to a
broad group of participants, and varied the degree of tool modularity for each
participant. Based on a modified version of the prototyping task by Kulkarni
et al., we asked participants to create as many unique and novel electric alien
creatures as possible in a 10-min period (Fig. 4). We provided the following
prompt, adapted from Marsh et al.:
   Imagine a planet like earth existing somewhere else in another universe. It is currently
   uninhabited. Your task is to design new creatures to inhabit the planet. The creatures in this
   world are very special, since they all are made up of tiny electric blocks. With 10 minutes
   allotted, draw and describe as many new and different creatures of your own creative design
   as you are able. Duplications of creatures now extinct or living on the planet Earth are not
   permitted.
Fig. 5 The alien circuit interface where participants could drag LEDs onto a 16  16 grid
Building Blocks of the Maker Movement: Modularity Enhances Creative. . .                149
asked to physically arrange and describe each circuit board design, where each
circuit would be a visual representation of a novel creature. The LED components
were presented as a possible design option to add “virtual” colored lights. Using this
software representation of a circuit allowed us to manipulate features of the
interface for three user groups.
Fig. 6 Three groups. Groups A and B were given a technical description of LEDs. Participants
had to select the correct wire orientation for the placement of each LED
150                                                                      J. Sadler et al.
4.2 Procedure
Each participant engaged in four steps: (i) pre-survey, (ii) description of task and
creature examples, (iii) 10 min task, and (iv) a post-survey. Before and after the
task, participants were asked to rate their level of confidence in prototyping with
electronics, and their current feeling of creativity on a 1–7 point Likert-type scale,
with 7 being the highest. At the end of the study, cognitive flow was measured using
the flow short scale survey developed and validated by previous studies on flow
(Engeser and Rheinberg 2008). We logged usage statistics, such as the number of
creatures created and the number of LEDs used per participant. For each group we
computed the (Y1) average quantity of creatures created, (Y2) average number of
LEDs used, (Y3) average change in creative feeling between pre-task and post–task
surveys, and (Y4) average change in perceived self-efficacy.
5 Results
All participants (N ¼ 86) attempted to create creatures, and the virtual circuit
building exercise appeared to be an effective design task for generating diverse
prototype ideas (Fig. 7). Qualitatively we found that the 1616 grid was sufficient
to express creative ideas and most participants provided colorful verbal descrip-
tions. Statistically comparing the control group with each group (two-tailed t-test),
we found the following results (summarized graphically in Fig. 8):
1. Prototype Quantity: Participants in Group B made significantly fewer creatures
   than the Control. (2.42 vs. 3.47, p < 0.05). Both Group A and B used fewer LEDs
   than the control group (A ¼ 16.6 B ¼ 12.62 vs. Control ¼ 27.6, p < 0.05). There
   was no statistical difference between the prototype quantity of Group A and the
   Control (3.43 vs. 3.47, p > 0.05).
2. Self-efficacy and Creative Feeling: Both the Control and Group A reported an
   increase in creative feeling and self-efficacy after the design task (+0.57 and
   +0.47, and +0.5 and +0.44, respectively). Group B reported a decrease in
   creative feeling and self-efficacy (0.58, 0.37, on the 7-point Likert scale),
   and these changes were significantly different than the Control (p < 0.05).
3. Cognitive Flow: Group A had an increased flow score compared with the
   Control (4.99 vs. 4.56, p < 0.05). There was no statistical difference between
   the flow scores of Group B and the Control.
Building Blocks of the Maker Movement: Modularity Enhances Creative. . .                        151
Fig. 8 Summary of prototype quantity, self-efficacy change, creative feeling and flow across
groups
6 Discussion
The results of this study show that we can measurably increase a designer’s
perceived self-efficacy and creative feeling through the use of modular prototyping
tools. In contrast, exposing designers to a higher degree of technical detail while
prototyping (e.g., through having one select the correct wire orientation for each
LED) had a significantly negative effect on design performance, as evidenced by a
decrease in prototype quantity, self-efficacy, and creative feeling for participants in
Group B. It appears that repeated exposure to technical interruptions while
prototyping can create a cumulative barrier to one’s design creativity. The act of
wiring individual LEDs diverted time away from the task of creating, which was
seen through the reduced quantity of ideas generated by Group B. However, the
increase in cognitive flow observed in Group A (the group presented with a
consistent wiring orientation for each LED) suggests that exposure to technical
choices that require minimal cognitive interruption contributes to a state of flow,
which corresponded to an increase in creative feeling and self-efficacy. The find-
ings from this study underscore the importance of increasing the modularity of
technology design tools to encourage playful prototyping and creative expression.
While this pilot study shows promising results with a software simulation and
crowd-sourced participants, future work will focus on matched cohort testing
with physical parts, rather than simulated circuits. Also, we aim to evaluate how
modularity impacts objective measures (i.e., prototype quantity, accuracy, and
functionality) associated with the designs produced.
Building Blocks of the Maker Movement: Modularity Enhances Creative. . .              153
Fig. 9 The Bloctopus physical prototyping toolkit: an example of a modular electronics and
software toolkit (Sadler et al. 2015a, b)
This research highlights that modularity increases the novice’s confidence to create
prototypes in a constrained domain. While the prototyping task was virtual, we
expect to find comparable results when adapted for physical toolkits, and this is the
focus of ongoing work. The Bloctopus physical prototyping toolkit of Sadler
et al. (2015a, b) shows an initial technical implementation of a plug-and-play
electronics system that prioritizes black-box modular interactions, and interfacing
with physical LEGO blocks (Fig. 9). Future work is exploring how to overcome the
functionality ceiling that is linked with a black-box module approach (Sadler
et al. 2015a, b). This ongoing work has highlighted that both the software and
electrical domain can benefit from increased modularity. We see such integrated
hardware-software modular interactions as the first step in a more universally
accessible prototyping toolkit.
7 Conclusion
creative benefits on prototyping tasks. Specifically, the results show that increasing
the modularity of design tools allowed participants to create more prototypes, and
significantly increased designers’ feelings of creativity, self-efficacy and cognitive
flow. This study lays the groundwork for an understanding of how modular tools
amplify creative prototyping.
References
Anderson C (2014) Makers: the new industrial revolution. Random House, New York
Baldwin CY, Clark KB (2000) Design rules: the power of modularity, vol 1. MIT Press,
   Cambridge
Bandura A (1977) Self-efficacy: toward a unifying theory of behavior change. Psychol Rev 84
   (2):191–215
Csikszentmihalyi M, Csikszentmihalyi IS (1992) Optimal experience: psychological studies of
   flow in consciousness. Cambridge university press, Cambridge
Dow SP, Glassco A, Kass J, Schwarz M, Schwartz DL, Klemmer SR (2010) Parallel prototyping
   leads to better design results, more divergence, and increased self-efficacy. ACM Trans
   Comput Hum Interact 17(4):18
Engeser S, Rheinberg F (2008) Flow, performance and moderators of challenge-skill balance.
   Motiv Emot 32(3):158–172
Kulkarni C, Dow SP, Klemmer SR (2014) Early and repeated exposure to examples improves
   creative work. In: Plattner H, Leifer L, Meinel C (eds) Design thinking research. Springer,
   Heidelberg, pp 49–62
Hartmann B, Doorley S, Klemmer S (2008) Hacking, mashing, gluing: understanding opportu-
   nistic design. IEEE Perv Comput 7(3):46–54
Hartmann B, Klemmer SR, Bernstein M (2005) d. tools: integrated prototyping for physical
   interaction design. IEEE Pervasive Computing
Marsh RL, Landau JD, Hicks JL (1996) How examples may (and may not) constrain creativity.
   Mem Cogn 24(5):669–680
Miller GA (1956) The magical number seven, plus or minus two: some limits on our capacity for
   processing information. Psychol Rev 63(2):81
Pimmler TU, Eppinger SD (1994) Integration analysis of product decompositions. Alfred P. Sloan
   School of Management, Massachusetts Institute of Technology, Cambridge
Sadler J, Durfee K, Shluzas L, Blikstein P (2015a). Bloctopus: a novice modular sensor system for
   playful prototyping. In: Proceedings of the ninth international conference on tangible, embed-
   ded, and embodied interaction. ACM, pp 347–354
Sadler J, Shluzas L, Blikstein P, Katila R (2015b) Creative chunking: modularity increases
   prototyping quantity, creative self-efficacy and cognitive flow. In DS79: Proceedings of the
   third international conference on design creativity, Indian Institute of Science, Bangalore
Tierney P, Farmer SM (2002) Creative self-efficacy: its potential antecedents and relationship to
   creative performance. Acad Manage J 45(6):1137–1148
                  Part III
Measuring Design Thinking
Measuring the Impact of Design Thinking
Abstract This article focuses on how organizations measure the impact of design
thinking. The results are based on a quantitative survey that is complemented by
qualitative interviews with experienced design thinkers. Even though a majority of
respondents perceive some kind of impact, only a minority has tried to determine
the impact in some way. Those who do not evaluate the impact, often do not know
how or lack the necessary resources. The metrics of those who do measure design
thinking’s impact vary considerably, but customer feedback and satisfaction is a
recurring theme. We propose that the traditional means of performance measure-
ments are often ill-suited for evaluating the impact of design thinking. We conclude
with a promising industry example of how traditional measures can be used to
gauge overall performance and how a story-based approach can capture the role of
design thinking.
1 Introduction
   Keeping in mind the wide variety of organizations with their different goals and
definitions of success, we wondered what practices exist in the hope they would spark a
fruitful discussion as well as inspire those who are in search for their own measurements.
   The results from our survey and interviews indicate that traditional means of
performance measurements are often ill-suited for evaluating the impact of design
thinking. We conclude with a promising industry example of using traditional
measures to gauge overall performance and a story-based approach to capture the
role of design thinking in the (innovation) process.
1
  The data presented in this chapter is part of a larger study on the situation of design thinking in
organizations and can be accessed here: http://thisisdesignthinking.net
Measuring the Impact of Design Thinking                                                         159
case they were not applicable or required information that participants did not want/
were not allowed to give. Allowing participants to skip questions entails varying
numbers of respondents for each question. For better readability, we will refer to the
sample as a whole as N and to all subsamples as n without further distinction. The n
refer to the subsample described in the respective paragraph.
   The survey was conducted with an online questionnaire designed with the
research software Qualtrics.2 Pretests were conducted using concurrent and retro-
spective think-aloud techniques and probing: In two iterative phases the question-
naire was tested by design thinking experts with regard to comprehensibility and
coherence of the questions.
Questionnaire Sample We invited managers and employees in organizations all
over the world who applied design thinking to participate. E-Mail lists, social media
(such as LinkedIn groups, Twitter, facebook groups) and existing contacts were
the main channels to recruit the convenience sample. A total of N ¼ 403 people
answered our call in the first half of 2014.
   About half of the respondents were managers of teams or organizations (51 %,
n ¼ 235), the remainder were design thinking team members.
   Most of our respondents were from profit-oriented organizations (65 %,
n ¼ 219,3 see Fig. 1). Another 16 % were from non-profit organizations, 9 %
from governmental organizations, and 2 % from public-private partnerships
(PPP). Most of the 8 % that make up the none of the above category were
universities, freelancers, consultancies and social businesses.
   Organization size was recorded in terms of the number of employees (European
Commission 2003). 26 % (n ¼ 118 (see footnote 3), see Fig. 2) indicated working in
a ‘micro’ organization with 1–9 employees, another 26 % in small organizations
(10–49 employees), 12 % came from medium-sized organizations (50–250
employees), 36 % from large organizations (250 employees). The 36 % of large
organizations comprises 23 % of organizations with 250–9999 employees, and
13 % with more than 10,000 employees.
2
    http://www.qualtrics.com
3
    If two or more participants were from the same organization, that organization was counted once.
160                                                                           J. Schmiedgen et al.
4
 The strong representation of information and communication organizations might be due to the
fact that many of our survey participants have or have had ties to the IT Systems Engineering
division of Hasso Plattner Institute and therefore an immanent IT focus in their work. Another
explanation could be that the IT sector faces high innovation pressure caused by digitization and
user empowerment. From experience we know that design thinking heavily resonates in those (ICT)
environments. Therefore it may not be surprising that the many responses from the IT sector are
representative of its role at the forefront of organizations that have to react faster than others.
Measuring the Impact of Design Thinking                                          161
5.   a mindset,
6.   a toolbox for user research and group creativity,
7.   prototyping, or
8.   a culture.
   Analyzing the differences and nuances of design thinking definitions deserves a
book chapter on its own (Lindberg et al. 2010), so we will limit this insight to
mentioning the most common themes at this point. These themes illustrate that
design thinking is understood and practiced in a multitude of ways in organizations.
As we will see, the definitions are just as multifaceted as the approaches to
measuring them. The fact that so many different definitions exist challenges the
idea of a general design thinking standard of measurement.
5
  For better readability we corrected obvious misspellings and adjusted initial capital/lowercase
letters without comment according to the Chicago Manual of Style (2010). Additionally, we
converted capital to lowercase letters within quoted passages for general terms and references to
bodies or concepts, also without comment. Many of our respondents were not native speakers of
English. Therefore, their statements might not always correspond to Standard English.
Measuring the Impact of Design Thinking                                            163
Too Little Experience Another group of participants was not yet able to define
any measurements, because they have had “too little experience with DT so far” or
it was “too early” to measure it respectively. One respondent further assumed it is a
risk to bring in new parameters at an early stage: “It is still so young [new], and the
innovation departments are reluctant to strangle themselves with further KPIs.”
Lack of Resources A third group of respondents wishes to measure the impact,
but has a lack of resources. As one participant put it, “Our organizational unit does
not pay adequate attention to KPIs, so this lack of tracking success is not specific to
design thinking.” Another team tries “[. . .] to keep up with the status quo of the
projects, but usually after finishing the user-centered part of it, it is unfortunately
not up to us anymore”. Some also “don’t have time for it” or have “[. . .] no time and
money”.
Flying Under the Radar Some respondents ‘fly under the radar’ of their manage-
ment. Measuring the impact of design thinking would require a “mandate from
managers”, but “design thinking is not a fully supported approach by the manage-
ment in improving the working efficiency” or “DT has never been introduced
formally.”
Working Culture For those who consider it part of their working culture, one
respondent stated that the “benefits of design thinking in our organization are in
most cases intangible” and another respondent wondered: “How would you mea-
sure enhanced teamwork?” Design thinking was also found hard to measure since
there are “too many soft facts involved.” Furthermore, one said “it is too much
about feelings, soft-skills and culture and hard to measure.”
Unnecessary or Impossible to Measure Then again, not everybody is convinced
of the necessity to measure the impact of design thinking at all, because it is just
“not needed yet,” or they have “not thought about it yet.” One person simply said:
“Why should we?” while another is “not a fan of measuring.” Measuring design
thinking’s impact is also of little importance when it “is not viewed as a ‘real’
process”, but rather considered to be “[. . .] an unproven theory.” Others consider it
impossible to measure, because it “is a mindset[,] either you have it or not” or “it’s
one method between others.”
Surprisingly, quite a few people first negated measuring the impact of design
thinking. But when asked “why not?” they mentioned actions that indicated mea-
surements of all sorts, even if rudimentary. After answering ‘no’ to the question:
“Do you measure the success of design thinking in your organization?”, one
respondent tacks on a measurement almost as an afterthought: “We are the first
164                                                                      J. Schmiedgen et al.
company in our market applying design thinking. So first we encourage and educate
the market. We measure the results and feedbacks from design thinking related
events organized by us [. . .]”.
   This respondent is not alone, as the following examples illustrate:
   “Why don’t [emphasis added] you measure the success [of design thinking]?”
• “How? Of course, we get feedback from our clients, and track their develop-
  ment, but other than that?”
• “Because our department is measuring it by business cases that are generated by
  it, but not the application of it by itself.”
• “We don’t know how to measure it in terms of KPIs. We collect feedback, on
  whether we BELIEVE it worth-while investing all the love and resources that we
  invest, and then adapt accordingly [. . .].”
• “It does not really make sense, but we collect good examples and cases to show
  to others.”
   The mindset of these responses suggests a rather paradoxical situation. It seems
that some teams have found measurements, in the broadest sense, they deem useful
(since they are in use), yet found themselves unable to elevate these same tools to
the status of “measurement-hood.” At the same time they are lacking in what they
might consider a valid measurement. Do they not trust their own judgment? It
seems challenging to people to define factors that are important for assessing the
effect of design thinking.
Of those 40 participants who indicated that they do measure the impact of design
thinking, 23 participants provided details into the ‘how’. A few answers hinted at
operationalizations (such as “stunning ideas per hour”) and many seemed to rely on
reflection.6
Customer Feedback One theme that recurrently emerged was customer feedback
and satisfaction. Knowledge was gathered “internally: customer centricity score
[. . .]; externally: net promoter score, KPIs of units, customer satisfaction [. . .]” or
with “client feedback” as well as “responses to campaigns” and “[. . .] brand value
perception from the user’s perspective.” Others focused on “usability metrics, how
[design thinking] affects the user experience” and “usability measurements in terms
of efficiency, effectiveness and satisfaction, customer validation[,] interview
results, number of live customers.” One respondent even commented: “In my
6
  Unfortunately, some measurements were described in terms so vague that they were impossible
to interpret.
Measuring the Impact of Design Thinking                                          165
experience KPIs are meant to deliver exactly the message the measuring party
wants to deliver. I prefer customer/user feedback and testimonials.” These quotes
show that the user feedback is a common evaluation tool for design thinking.
Consistent with research on empathy with the user (K€oppen and Meinel 2015),
Wendy Castleman from Intuit emphasized the importance of the user as well. She
said: “I would say the [companies] who have any success are focused on their
customers. I do talk to companies frequently. They are interested in the innovations.
They want things that are going to be successful and bring them to a new level. But
if they don’t actually care about their customer experience it isn’t [going to] work.
Because in order to really be revolutionary and different and inspirational in things
that people buy, it’s [got to] solve their problem, right?”
Design Thinking Activities Some respondents measured ‘design thinking activi-
ties’ such as “[number] of projects, trainings, trained people, junior coaches”, “how
many employees are formally trained”, or the “number of coaches, number of
projects”.
‘Immediate’ Results Some participants went one step further by measuring the
‘immediate’ results of their design thinking projects, manifested in: the “number
of transferred innovation to development,” number of “concepts finish[ed] in
actual strategic plans,” and “outcome of DT projects, e.g., launch of application.”
One participant mentioned the “number of projects that get second round
funding”, yet adds: “It’s a bit manufactured [fabricated], but since we need a
KPI. . .”
Traditional KPIs Those who took a closer look on more traditional KPIs focused
on “financial performance,” “market success,” “success and revenue of projects
initiated by DT,” and “the ROI on each project undertaken,” and “increased sales.”
Reflective Measurements Questionnaires and other reflective measurements are
not uncommon, taking the form of “evaluation forms”, “survey[s] of depth of
knowledge and application of process and tools in the organisation, qualitative
feedback during and after projects” or “questionnaires after each DT process step.”
Working Culture A few directed their attention to the impact on working culture
and measured “motivation, effectiveness, team collaboration [and] engagement”.
When so many of our survey respondents seemed at a loss as how to measure the
impact of design thinking, they touched upon an important topic. The problem with
creating metrics for design thinking in the context of a larger innovation effort can
166                                                                            J. Schmiedgen et al.
When asked how the impact of design thinking was measured, two participants
responded by “success stories” and “we collect good examples and cases to show to
others”. What does this mean?
   We spoke with Kaaren Hanson, Vice President of Design at the time, and Wendy
Castleman, head of the ‘catalyst program’8 from Intuit, a software provider of
business and financial management solutions for small and mid-sized businesses.
They provided us with details on their version of a story-based approach. They
internally showcase success stories in books produced each fiscal year. Each story
enables the reader to trace back case-specific measurements, which often emerge
alongside the individual project.
   For Intuit, isolating and quantifying the specific impact of design thinking has
turned out to be impossible because “it has just permeated what we do and how we
do it now. So, [. . . design thinking] becomes such a part of the company [that] you
can’t separate it out and measure it per se.” They went so far as to conclude that “the
7
  The butterfly effect is a concept stemming from chaos theory. It postulates that small events in a
complex system can have large, widespread consequences. In hindsight, it is nearly impossible to
isolate specific causes (Hilborn 2004).
8
  The catalyst program allows employees with regular jobs but trained in design thinking—so-
called innovation catalysts—to spend 10 % of their working hours to coach workshops.
Measuring the Impact of Design Thinking                                                167
way we look [at] success is not so much about metrics. There are so many things
that can confound a metric.”
   Even though they accepted that quantifying the impact was a futile endeavor,
they still strived to capture the role design thinking played in their work. So instead,
they chose to take a qualitative approach. Now they create stories that can be shared
within the organization and allow other departments the opportunity to see and
understand the impact of design thinking more easily and share the success on a
different level: “[. . .] I think stories are in fact the most impactful [. . .]. Anecdotes
are powerful.”
   Yet the cases are more than a mere collection of anecdotes. Every single one
contains the important measurements the company cares about like revenue, cost,
profit, employee engagement, and customer engagement. But measurements need a
context for people to relate to—so they wrap them into stories.
   Before developing their story-based approach, Intuit went through several stages
of measuring that sounded very similar to those our survey respondents described.
They started off by measuring activities (“How many people are doing this?”) and
examining several outcomes like increased revenue, decreased cost, increased
employee engagement, or increased customer experience. Wendy suggests: “First
measure activity, then start to measure impact with whatever your company cares
about.” But the types of measurement they start with do not always remain
the same.
   For Intuit, design thinking has a profound impact on the choice of appropriate
types of measurement. When exploring the problem at hand (“problem reframing”),
initial metrics can become obsolete. One key to Intuit’s successful practice is
allowing new measurements to emerge during the process. Kaaren experienced
that “often people want a measure[ment] from you that you can’t provide them
with. But you can give them something else.” She remembered one case where “the
metric that mattered to the team when they started and what they were asking for
was ‘increase conversion’. The metric they ended up getting (e.g., customer and
employee engagement) was something completely different. But everything felt
much better about it.” Additionally, allowing new measurements to emerge can also
lead back to initial metrics, because in the example above “ultimately [. . .] it
increased conversion!”
   Apart from allowing measurements to emerge during a project, Wendy offered
another piece of advice: “People ask about our innovation program and design
thinking all the time. Often they are just interested in the innovations. They want
things that are going to be successful, which brings them to a new level [. . .]. But if
they don’t actually care about their customer experience it isn’t [going to] work. [. . .
You] have to understand what your company culture values and make sure you
align to that first!”
   The way Intuit solved their problem struck us as remarkable because it contained
elements from all the other insights. It describes the difficulties of quantifying the
impact of design thinking, especially the more it is integrated into the entire
organizational culture. Over the years Intuit tried various ways to measure the
impact of design thinking. Finally, they reached the conclusion that isolating a
168                                                               J. Schmiedgen et al.
quantifiable impact was impossible. With their story-based approach they found a
balance between quantitative measurements for overall projects and qualitative
ones to describe the role of design thinking in the innovation process.
9 Conclusion
     measurements are still valid. Our interview partners at Intuit prompted the
     following example: One of their KPIs is the net promoter score (NPS) to
     measure client satisfaction. The score represents the ratio between so-called
     promoters (who indicate that they are very likely to recommend your company/
     brand/product X to a friend/colleague/relative) and so-called demoters (who
     indicate that they are not likely to make a recommendation). Picture the follow-
     ing situation: An organization just made major improvements to their product
     and raised the price when a financial crisis hit. What would be likely to happen?
     Customers complain that the product is too expensive, so they are less likely to
     recommend it to someone else. As a consequence, the NPS goes down—but does
     that mean they should not have improved their product? Most likely, the NPS
     gets a dent for a while but improves in the long run—when people start
     appreciating the improvements. So unless a metric is extremely robust, relying
     on a measurement without considering the bigger picture is risky, sometimes
     even harmful.
   Most of our respondents were from for-profit companies, so many of our
examples are too. However, hopefully for-profits and non-profits alike (and all of
those in between) will find useful information on measuring the impact of design
thinking in an organizational setting.
Acknowledgments All results are part of a survey on the impact of Design Thinking in organi-
zations. We would like to thank Prof. Hasso Plattner and the HPI-Stanford Design Thinking
Research Program Committee for their support. We thank Dr. Sharon Nemeth for her editorial
support.
9
  Questions based on concrete action could be: “Do you record the number of design thinking
projects?” or “Do you collect customer feedback?”
170                                                                             J. Schmiedgen et al.
References
Abstract The creative behaviors that underpin design thinking are difficult to
measure. This is problematic because people who have a desire to practice design
thinking in an organizational context are often assessed only on their ability to
execute via traditional metrics. Therefore they have less incentive to work in a
creative way. In order for organizations to fully support and incentivize design
thinking, they must measure creative behaviors as much as they do executional
behaviors. This chapter highlights a suite of initial metrics that arose from research
on d.school alumni and organizations applying design thinking as a core driver of
their innovation strategy.
1 Introduction
What Gets Measured Gets Done This popular business adage reveals a barrier
many people face when applying design thinking. Most organizations rely on
efficiency and productivity to succeed. As such, measurement is geared towards
identifying and rewarding employee behaviors that support these execution-
oriented goals. Most organizations that turn to design thinking as a methodology
for innovation know the process will be more exploratory and less focused on
execution. This means that they need to identity and reward different employee
behaviors—more creative behaviors—that support creativity-oriented goals. How-
ever, the vast majority of organizations continue to utilize execution-oriented
measures without adding any creativity-oriented measures. This results in two
major problems. The first is that most organizations lean on metrics, at least in
part, to determine the success of new initiatives. If design thinking cannot demon-
strate success in a measurable way, it may be abandoned for other innovation
methodologies that are easier to track (Nussbaum 2011). The second major problem
2.1 Background
One of the fundamental goals of the d.school is to instill creative confidence in its
students (Kelley and Kelley 2013). But what does this really mean? Can we define it
in more precise terms? Because this is regarded as an internal construct, it makes
sense to turn to the field of psychology for guidance.
    Creative confidence is often linked to the notion of self-efficacy developed by
Albert Bandura (Bandura 1977). He defines it as people’s beliefs about their
capabilities to produce designated levels of performance that exercise influence
over events that affect their lives (Bandura 1994). Most psychologists believe that
confidence is a different construct than self-efficacy (Schunk 1991). Confidence is a
general belief in one’s self, whereas, self-efficacy is more subject matter specific.
For example, someone may have high self-efficacy in music but low self-efficacy in
math. That same person may also have a high or low general self-confidence. This
begs the question, is creativity a subject or something larger?
    Even if we accept creativity as a subject, and use the term creative self-efficacy,
it still is not clear that creative confidence is the same thing as creative self-efficacy.
Self-efficacy might be an important aspect but not completely equivalent. Another
Developing Design Thinking Metrics as a Driver of Creative Innovation                 173
goal of the d.school is to help students apply design thinking in real world contexts.
In fact, that is the basis behind the executive education program. Given this,
creative self-efficacy seems to provide an incomplete definition. After all, if one
has all the belief in the world about their creativity but never applies it, the d.school
probably would not say that person has a lot of creative confidence. That is why we
developed creative agency, the ability to apply one’s creativity, as a complement to
creative self-efficacy (Royalty et al. 2012). We believe the two together make up
creative confidence.
    Creative agency was developed and turned into a scale in previous HPDTRP
projects (Royalty et al. 2014). It consistently demonstrated a significant change in
people’s dispositions before and after design thinking learning interventions. How-
ever, we never tested it with a sufficiently large control group. This year we
identified two separate control groups for a quantitative, survey-based study. The
treatment group was comprised of students who applied and enrolled in the
10-week introduction to design thinking “Bootcamp” course taught at the d.school
during the fall quarter. One control group we identified was the set of all students
who applied to the Bootcamp course but were not admitted. In most cases, the
students not accepted were not necessarily less qualified; there simply were not
enough available spots in the course. The other control group was made up of
students who took a more traditional product design course on innovation and
design. This course is also in the school of engineering and helps students generate
new solutions to address an unmet need. The instructors, however, teach a different
process than the d.school. The course is taught every quarter of the academic year,
but not every student takes all three quarters. We only surveyed students from the
fall quarter because that matched up with the fall quarter Bootcamp course and
because the final two quarters have more of an implementation focus, which
extends beyond the scope of most d.school courses.
    In addition to capturing creative agency, we also surveyed for creative growth
mindset. This is an adaptation of Carol Dweck’s work on mindsets (Dweck 2006).
She has shown that people who believe that intelligence is malleable (growth
mindset) tend to perform better than people who believe that intelligence is static
(fixed mindset). The survey used to detect a growth mindset is well established and
has been modified to measure growth mindsets in other areas. However, there is no
research yet using a modification of this survey to measure a creative growth
mindset.
    This construct is interesting for at least two reasons. One is that it can potentially
shed some light on the people who choose to apply for d.school courses. One might
expect people who applied for the Bootcamp course to have a more malleable view
of creativity then those who took the Smart Products course and did not apply to the
Bootcamp course. In other words, they believe they can enhance their creativity and
that is a reason why they take a d.school course. The second interesting point is that
a creative growth mindset may initially not be very high in d.school course
applicants but may increase during the course of their experience. In this case, we
would expect to see a change in the pre and posttest for students who took
Bootcamp.
174                                                             A. Royalty and B. Roth
    It is important to note that the psychological constructs listed above may very
well affect how people continue to learn and apply design thinking. Self-efficacy
has been shown to lead to increased motivation and persistence (Schunk 1991).
These traits are especially important for students who graduate and struggle to use
design thinking in a real world context. In most cases, organizations have not
learned how to support design thinking. If we boost graduates’ creative self-
efficacy, they should be more resilient as they attempt to practice this methodology
in companies that are not as nurturing an environment as the d.school. This is
important because the success of design thinking relies on people applying it
beyond the d.school.
    Similarly, people with growth mindsets respond better to failure (Dweck 2000).
They see it as an opportunity to improve rather than a judgment of their character.
That suggests that the higher the creative growth mindset, the more likely alumni
are to continue to leverage their creativity in the face of failure.
2.2 Materials
We created both an online and paper based survey. The creative agency scale is
comprised of 11 five-point Likert items. There were 3 six-point Likert items on the
creative growth mindset scale. The remainder of the survey featured demographic
questions.
2.3 Procedure
A link to the online version of the survey was added to the end of the Bootcamp
course’s web-based application. Over 120 applicants completed the survey. The
Bootcamp teaching team admitted 33 students who completed the survey. On the
final day of class (the final day of the fall quarter), the students were given the
paper-based version of the survey. This yielded 31 Bootcamp students who took
both the pre and the post survey. The online version was emailed to all the
applicants who took the pre survey but were not admitted into the course. A total
of 51 subjects responded. None of them enrolled in any d.school course during the
fall quarter. This is not surprising because there are very few other d.school courses
taught in the fall.
   The paper-based version of the survey was given during the first and last weeks
of the fall quarter of a product design course. This course covers topics related to
innovation but does so using a more traditional educational model. From that course
31 students out of approximately 45 enrolled completed both the pre and post
survey.
   At the end of the 2013–2014 academic year, an online version of the survey was
sent to subjects in all three conditions who completed both the pre and post survey.
Developing Design Thinking Metrics as a Driver of Creative Innovation                      175
   The creative agency scale captures people’s confidence in applying their own
creativity. The average pre and post survey response of subjects in the three
different conditions is listed in Table 1.
   The results of the end-of-year survey are still under analysis. One of the major
questions is if creative agency is sustained months after completion of a d.school
course. This would fit with previous results (Hawthorne et al. 2014). However, it is
not yet know how many respondents in any of the conditions enrolled in d.school
courses during the winter and/or spring quarters. It may be the case that we will also
have enough data to explore the effect taking multiple courses has on creative
agency changes.
   The creative growth mindset measure computes a score between 3 and 15. There
was little change between pre and post survey averages for the three conditions.
Additionally, there was virtually no difference in the averages between students
who took Bootcamp and those who did not. However, students who applied to
Bootcamp average a point higher than those who took the traditional product design
course; 13–12.
2.4 Discussion
The survey analysis suggests that taking Bootcamp did have a significant effect on
students’ creative agency. It is also interesting to note that none of the pretest
averages were significantly different from one another. This indicates that creative
agency is not a predictor of who is going to apply for a d.school course.
   It appears that creative growth mindset does not significantly change in any
condition. Though this might be due to a ceiling affect as most students exhibited
moderate or high growth mindset. Still the applicants to the Bootcamp course did
have a higher creative growth mindset. This is not completely surprising as one
would assume that students taking a d.school course would be doing so in part to
work on their own creative capacity. Still having a high creative growth mindset in
of itself does not lead to an increase in creative agency. It might be the case that a
high creative growth mindset is a necessary but not sufficient requirement for
boosting creative agency. A next step would be to investigate the creative agency
changes in Bootcamp students with low initial creative growth mindset.
176                                                                A. Royalty and B. Roth
3.1 Background
while training 10–12 associates from each company in mixed teams. This session
was hosted by the financial firm in April. The retail firm will host the next session.
An emergent goal is to find a large problem that the four-company collaboration can
work on together using design thinking. This is in line with one of the original
promises of design thinking to solve large, complex problems (Cross 1990).
Although this collaboration did not change our fundamental research question, it
did provide an additional opportunity as we were asked to help them measure the
impact of this collaboration in addition to the impact of design thinking in each
organization.
3.2 Procedure
We began this project by addressing our second design principle; understanding the
goals each company has for using design thinking. Although each organization is a
different context for design, we decided that because they have enough in common
to work together, there might be patterns across the organizational goals that we
could leverage to design measurement tools for all four to use. Two different
interventions led to our understanding of these organization goals.
   The first intervention was eight semi-structured interviews of innovation cata-
lysts. We asked them why their companies began using design thinking and how
that has evolved over time. Essentially we asked them about the goals of the
organization. The second came from 17 reflection sheets designed to uncover
how innovation catalysts teach design thinking (see Fig. 1). Each sheet is a blank
timeline of a given employee’s journey through a design thinking training.
The innovation catalysts filled in the journey of learning design thinking they
attempt to take their colleagues through. The idea is that the goals of the design
thinking trainings should align with, or support, the design thinking goals of the
organization.
    Using these data, the initial analysis of which is included in the following
section, we discovered general categories to measure. From there we worked with
the innovation catalysts to modify existing measures and try new ones that we
mutually felt were beneficial to the companies’ innovation goals.
    Our aim was to develop measures that are quantitative in nature and output to an
ordered variable. This is important because we want the ability to rate teams as
high, medium, or low. This is not to say that there should be no qualitative
component. In fact we believe that qualitative measures could—and should—be
added, just not relied on exclusively. Finally, these measure are meant to be easily
captured throughout the course of a single project. Once a new project starts, the
measures reset.
    One important thing to note is that these measures are not necessarily meant for
all teams in an organization, just the teams slated to use design thinking. There are a
number of projects where teams implement traditional methodologies and success-
fully achieve their non-design thinking related goals.
3.3 Results
Three large themes around why these companies are turning to design thinking
immediately immerged:
1. A company wide disconnect from the end user.
2. Fear of a startup taking future business.
3. Desire for teams to work in a more innovative way.
   Each organization feels that in one way or another, their project teams do not
understand what their customers want or need. Here is an example from an
innovation catalyst: “people don’t trust financial institutions after 2008. Even
though [we aren’t] the one to blame, they don’t trust us. We need to understand
what is behind that distrust in order to better relate to our customers.”
   Fear is an unexpectedly powerful motivator for design thinking. Two of the most
senior innovation catalysts shared that their organization is afraid of newer, more
nimble companies cornering the market by discovering the next big breakthroughs
and taking all of their business. They believe that entrepreneurs are more likely to
identify opportunities and successfully take risks.
   Finally there is a desire for teams to collaborate in a more nibble and entrepre-
neurial way. Numerous reflection sheets indicated that one of the training goals is to
teach employees the ability to flare and focus as part of the working process.
Apparently focusing is much more common than flaring.
Developing Design Thinking Metrics as a Driver of Creative Innovation             179
    These three insights inspired four categories of measures. The categories are
linked to design thinking principles that the four organizations adhere to. Below is a
list of the measures with a brief description.
The degree to which teams are connected to their end users can be captured by the
amount of meaningful contact they have with their customers. The following
measures aim to capture how close teams are to users.
Measure 1E The number of days gone without interacting with a customer. An
innovation catalysts has teams keep a running total of the number of days they have
gone without interacting with a customer (either conducting an interview, an
observation, or testing a prototype). As soon as they interact with a user, they can
reset that number to zero. This measure captures the duration the team goes without
user input. Alternatively, it would be possible to capture the percent of days teams
interact with a customer, but we feel the former is stronger because it provides
stronger formative feedback.
Measure 2E The number of users spoken with. The team keeps a running list of
the people they interview or test a prototype with over the course of the project.
This simply shows the amount of empathy being done. If one looks at empathy as a
source of ideas, then this translates into fluency (Runco 1999). The distribution of
user engagement could also be interesting. A team might start with a lot of different
empathy subjects in the beginning and then hone down to few users at the end. That
is why it is important to capture engagements during the entire duration of a project.
This measure is a case where qualitative data, namely what was learned from each
user, could be very beneficial to the team (and researchers).
Measure 3E The number of categories of people spoken with. The team creates a
list of persona types that they interact with during the project. The teams generate
the categories themselves. An example of such categories might be; single mothers,
elderly couples, unemployed millennials, etc. This is valuable because it indicates
how diverse their human centered work is. Connecting with customers means
connecting with both existing and future customers, something this measure pro-
motes. Again looking at empathy as a type of idea generation, this could be seen as
flexibility (Guilford 1968). As with measure 2E, capturing qualitative data about
the learnings from each category is highly encouraged.
3.3.2 Reframing
The ability to identify new and valuable opportunity spaces is a major component of
innovation. Startups do this well in part because they are nimble enough to
constantly change the problem they are working on.
180                                                              A. Royalty and B. Roth
Measure 1R The novelty/value grid is a tool that classifies the project objective.
Because the objective is expected to evolve as the project advances, this tool should
be used at regular intervals. The grid is comprised of two axes. The horizontal axis
is how novel this objective is. Has a project like this been worked on before? Does it
take the company into a new space? This doesn’t necessarily mean the creation of a
new good or service. It could also represent repurposing a solution in a novel
context (Amabile 1996). The key feature is that it extends the company into a
new space. The vertical axis is perceived value. How valuable is this objective to
the company? Individual team members plot the project objective somewhere on
the grid (see Fig. 2). This generates both a novelty component and value component
score from each team member. An aggregate score for each axis can be calculated
by taking an average or a weighted average across all respondents. Employees may
feel uncomfortable rating the value of their project, so this measure is intended to be
filled out anonymously.
3.3.3 Iteration
The desire for teams to work in innovative ways led to two categories. The first is
iteration, or how robust their prototyping process is.
Measure 1I The number of prototype iterations. This measure captures the num-
ber of iterations performed by each individual or small group. All four organiza-
tions are pushing for more prototyping. In general, frequent iterating has been
shown to lead to stronger prototypes (Dow and Klemmer 2011). Depending on
the project, it might make more sense to capture the number of prototypes per
feature. For example, one team may work on a small app while another may work
on an entire website. The latter team then has more opportunity to iterate because
there are more features. In that case it would make more sense to calculate iterations
per feature. To capture this, team members simply list each iteration they create and
what they hope to learn from it.
Developing Design Thinking Metrics as a Driver of Creative Innovation                 181
The final measure captures how well teams are working using design thinking. For
this measure we turn to the Interaction Dynamics Notation tool created by Neeraj
Sonalkar and Ade Mabogunje. Although this work is still developing, it would be
possible for the teams to capture video of themselves and send it in for analysis.
   Each measure, with the exception of the Interaction Dynamics Notation, has
been developed in tandem with the innovation catalysts. Although no single
measure is perfect, we believe that a suite of measures can identify the difference
between teams using a strong design thinking process and teams using a week
design thinking process.
   Because these companies are located across the United States, it is not feasible
for a research team to administer the measures. Therefore, the innovation catalysts
will ultimately be responsible for distributing the measures and collecting the
results. Fortunately they are motivated to participate since showing the impact of
design thinking is a major part of their job. A part many have struggled with up
until now.
   Before the initial trial teams are selected, we will interview the innovation
catalysis and ask them to map out the design thinking ecosystem in their organiza-
tion. Besides providing interesting structural information about how design think-
ing fits in their organizations, it will help us locate teams to test with. Our goal is to
prototype the measures on teams that have both a high and low amount of comfort
with design thinking.
3.4 Discussion
We hope to answer a few major questions with this trial. The first is if the tools are
easy and short enough for teams to complete. The second is whether or not the
formative evaluation of teams allows the innovation catalysts to provide better
support. The final question is what form factor makes these measures most effec-
tive. Should they be digital or paper based? Are they part of a large dashboard that
fits on a wall above a whiteboard? We should have some insight into these questions
after the first tests. Once these tools are fine-tuned, the next step is to compare
teams’ process to their project outcomes.
182                                                                       A. Royalty and B. Roth
   This work is important because the very act of measuring a methodology sends
the signal that the organization values this way of working. That creates an
incentive to use design thinking. Additionally, when we begin to compare teams’
process to the outcomes we can begin to describe the real impact of design thinking
in organizations. In essence, we will be able to respond to the question, do teams
that have a strong process produce more innovative outcomes?
References
Royalty A, Ladenheim K, Roth B (2015) Assessing the development of design thinking: from
   training to organizational application. In: Plattner H, Meinel C, Leifer L (eds) Design thinking
   research. Springer, Heidelberg, pp 73–86
Runco MA (1999) Divergent thinking. In: Runco MA, Pritzker SR (eds) Encyclopedia of crea-
   tivity, vol 1. Academic, San Diego, pp 577–582
Schunk DH (1991) Self-efficacy and academic motivation. Educ Psychol 26(3–4):207–231
                                Part IV
Documentation and Information Transfer
           in Design Thinking Processes
Experience and Knowledge Transfer
Through Special Topic Coaching Sessions
Abstract Design teams have different problems and needs during their projects.
For the teams’ coaches this can amount to a high workload due to acquainting
themselves with new topics and preparing different coaching sessions for the teams.
On the other hand, former teams might have experienced similar problems, and the
experiences and solutions of members from such a team could be valuable to the
current teams. In this chapter, we present the concept of special topic coaches that
lets former members of design teams provide additional coaching sessions to
current teams. This concept allows experienced members of design teams to
share their knowledge and experiences with a current team by preparing and
running a coaching session. Thus it relieves the dedicated team coaches while
providing the team with valuable input and establishing an exchange between
different design teams. The teams can discuss their problems with peers and receive
additional input on topics important to them when needed. To ensure that our goals
are met, we implemented the concept with the help of a coaching seminar parallel to
one of our design engineering courses. The seminar was evaluated through obser-
vations and questionnaires. We will present the results as well as derived changes to
the next iteration of the seminar and our ideas of how to adopt this concept in a
company context.
1 Introduction
Former members of design teams are often used as coaches for new teams to pass on
their knowledge and previous experiences. This can be seen in design courses as
well as in companies implementing design and innovation teams. However not all
members of a design team will become design thinking coaches later on, which
often means their expertise and experiences are unreachable for current and
upcoming design teams.
   In this chapter, we present a concept that allows former members of global
design teams to prepare and execute special topic coaching sessions for current
teams. These special topic coaches also have to prepare lean documentation about
the respective topics in order to allow for future reuse. The coaches offer a variety
of topics (e.g. decision making, interview acquisition, or presentation preparation),
which are assigned to the teams in accordance with the project schedule and their
current demand.
   With this concept we aim to offer teams additional helpful coaching sessions,
while taking some of the workload from the teams’ main coaches. Additionally we
hope to improve the knowledge exchange between current and former teams and
thus establish a stronger community of design team members in general.
   We implemented our concept in the form of a seminar in which alumni of our
global design and engineering course (Carleton and Leifer 2009) take on the role of
the special topic coaches. To evaluate the quality and impact of their coaching
sessions, we qualitatively analyzed each coaching input by observing the dynamics
of the session and identifying positive and negative aspects of coaching work.
Additionally, we performed a quantitative analysis by using questionnaires that
assess how well the input was perceived by the team, how helpful it was at the
respective point in the project, or how the ratio between practice and theory was
perceived by the team. The coaching sessions are observed with minimal interfer-
ence (i.e., observers only interfere in case of wrong inputs) in order to allow for an
undistorted assessment of their quality.
   The rest of this paper is structured as follows: Sect. 2 reflects on related work in
coaching and knowledge management, Sect. 3 introduces the general concept of
special topic coaches and implementation requirements, Sect. 4 describes our
implementation of the concept in form of a parallel coaching seminar to our global
design course. The evaluation of the seminar and the coaching sessions is discussed
in Sect. 5 while Sect. 6 gives an outlook on the next iteration of the course and
concludes the paper with a summary.
2 Related Work
coaches rated roles with a higher workload for themselves as worse than students
did, e.g. the instructor, facilitator, and supervisor roles. Furthermore they found that
high-performing teams and coaches saw their main roles as being a mentor,
facilitator and supervisor while coaches of low-performing teams saw themselves
mainly as mentor. These findings indicate that teams have different coaching needs
and require coaches to follow more than one coaching style as needed. If the team-
dedicated coach has to fulfill all the coaching needs of his team he will have a
higher workload. We address this issue with our concept of special topic coaches by
introducing additional coaches, thus lowering the workload of the dedicated team
coach and satisfying the coaching needs of the team.
   Hackman and Wageman found that effective coaching requires four conditions
to be met, two of which have to do with the coaches’ actions (Hackman and
Wageman 2005):
• “Coaching behaviors focus on salient task performance processes rather than on
  members’ interpersonal relationships or on processes that are not under the
  team’s control”, and
• “Coaching interventions are made at times when the team is ready for them and
  able to deal with them”
   We disagree with the first condition, as our teams specifically asked for help with
interpersonal issues, especially with cultural difficulties. A topic that has also been
addressed by other researchers, e.g., Grimheden et al. introduced special culture
coaches who served as ambassadors for the “other” culture (Grimheden et al. 2006)
in a global virtual team. In our seminar one of the alumni coaches gave a coaching
session addressing the cultural difficulties while the global team was visiting. We
agree with the second condition and want to address it by giving additional
coaching sessions when the teams request them.
   Inter-project knowledge management is a complex topic that faces specific
challenges (Pretorius and Steyn 2005; Shinoda et al. 2013) due to the unique and
temporal nature of projects. Shinoda et al. list the following challenges for knowl-
edge management in project oriented environments (Shinoda et al. 2013):
1. Retention difficulty—knowledge is fragmented due to the routines and structure
   in projects, and thus projects struggle to retain ideas
2. Inefficiency—if previously generated knowledge is unavailable solutions have
   to be created again
3. Low replication—projects have higher learning curves due to their uniqueness
   and low repetition
4. New relationships—people work as a team for a time and then separate, which
   makes continuous learning difficult
   Pretorius and Steyn have identified two strategies for knowledge management in
such environments (Pretorius and Steyn 2005):
1. Codification—storing explicit knowledge in a database
2. Personalization—sharing (tacit) knowledge person-to-person
190                                                                      F. Häger et al.
   They also state that it is important to share tacit knowledge through human
interaction.
   In our annual course we have seen some of the identified challenges and we
believe that the concept of special topic coaches can help overcome these by
providing a personal exchange of such knowledge as proposed by Pretorius and
Steyn.
As previously described, different design teams have different needs with regard to
coaching. While dedicated team coaches are well prepared to give general input on
design methodologies the teams often require additional input on special topics that
arise during the project and are often project specific. Expecting the dedicated team
coaches to prepare coaching sessions on all these topics would require them to get
acquainted with completely new topics or problems in which they do not yet have
expertise. Additionally, preparing such sessions results in a much higher workload.
On the other hand, members of other design teams can be experts on the topic or
have encountered similar issues in their own projects.
   These facts have led us to create the concept of special topic coaches. The
concept introduces current design teams to a pool of experienced design team
members who are willing to provide inputs and coaching sessions on very specific
topic they are experienced in.
   We believe that by introducing members of other or former design teams as
coaches for a specific topic we can foster exchange between projects, relieve the
dedicated team coaches from some of their workload and provide design teams with
valuable input for their current needs. To achieve such a setup it is necessary to
have an overview of available coaches and their special topics and to make the
design teams aware of the possibility of getting coaching for special topics/issues.
   The first goal can be achieved, e.g. by creating profiles of people who are willing
to give such coaching sessions including their areas of expertise as well as special
issues and characteristics of the projects they have worked on. Such profiles can for
example be collected by the dedicated team coaches at the end of a project and then
be kept in an intranet or a social platform, or they could be managed by a special
coaches coordinator.
   To make the teams aware of the special coaching topics, the dedicated team
coach should use reflective sessions. This could mean, for example, finding out
where additional input is needed and which open issues exist at the end of a day or a
working phase. The team coach or a coordinator for special coaches could then
make a connection between the team and an appropriate coach, to organize a special
coaching session.
   Another important aspect to consider is how to introduce these special coaches
to general coaching practices. This can for example be achieved through workshops
or trainings with experienced coaches.
Experience and Knowledge Transfer Through Special Topic Coaching Sessions        191
   To create a valuable coaching session, the coach should take some time with the
team he will coach to learn about their project and their current problems and needs.
He can then prepare and implement one or more coaching sessions for and with the
team. In order to document the coaching for reuse by himself or other coaches the
coach should also create a lean documentation of his work.
and have the possibility to gain insight into teaching and coaching. Therefore each
of our design teams in ME310 has a dedicated alumni team coach.
   The parallel coaching seminar which implements our concept of special topic
coaches runs during the first two thirds of the global design course. It allows
additional alumni of the course to gain experiences in coaching and share their
knowledge on different topics. It also provides the current project teams with a
possibility to receive coaching on specific topics as needed and takes some of the
workload of the dedicated coaches. The seminar thus addresses the issues we hope
to solve with the concept.
   Figure 2 depicts the sequence of events of the seminar. In a first meeting, all
alumni coaches were given an introduction to general coaching guidelines and the
advisors discussed possible topics with each alumnus based on their expertise and
experiences in their own projects. In addition, the advisors spoke with the three
current design teams and discussed the issues in which they wanted to receive
special coaching sessions. Based on these possible topic collections the advisors
assigned each alumnus to one of the three teams. The coaches met with the team
they were to coach and discussed the final topic and the team’s needs with the
students. Afterwards they prepared a coaching session and a lean documentation on
the topic chosen together with the team. This lean documentation served two
purposes. First, it was a handout for the coached team to remember the most
important aspects of the session. Second, it allowed the dedicated team coaches
of the other teams to repeat the coaching session with their team if the topic should
be interesting to them. The concept and the documentation were then reviewed with
the seminar advisors, before actually coaching the team. During the actual coaching
session seminar advisors took part as silent observers.
   To evaluate the concept in general, the coaching efforts and the impact on the
teams we devised a concept of evaluating each coaching session. This concept
allows us to gain additional insights into the dynamics, and positive and negative
aspects of such coaching sessions. The evaluation concept consists of two ques-
tionnaires, one for coaches and one for the coached students and an evaluation sheet
for the teaching staff. Alumni coaches as well as the coached teams were asked to
fill out the questionnaires after their coaching session, rating the session in general,
the use of tools (e.g. handouts, whiteboards, and exercises), the ratio of theory
vs. practice, and pointing out positive and negative aspects of the coaching sessions.
The advisors filled out the evaluation sheet during the session, rating preparation,
structure, theoretical and practical components, empathy of the coach for the team
and their general impressions.
5 Evaluation
In this year’s iteration of the seminar, seven course alumni gave coaching sessions.
All the alumni were first time coaches and alumni of last year’s course. Each coach
gave a session to one of the three four-student teams. For one of the coaching
sessions the partner team was available so the session included seven students. The
topics coached included decision-making as a team process, interview acquisition,
global documentation, presenting abstract concept in a tangible manner or how to
receive honest user feedback. Depending on the topic, the coaches gave one or
multiple input sessions lasting between 30 min and 4 h and provided two to four
pages of documentation. The first results show that alumni value the possibility of
passing on their gathered knowledge by giving coaching sessions, while teams find
alumni input valuable for their ongoing projects.
Six out of seven coaches answered the questionnaire after giving their coaching
session. We asked the coaches to describe positive and negative aspects of their
coaching session, the tools they used and whether they found them helpful, and the
ratio between theory and practice.
The positive aspects of the sessions, as described by the coaches, included engaged
team members, and practical tips and personal stories from the coaches own pro-
jects as a good way to explain their recommendations. The main disadvantages
mentioned were the limited time of the team members and some issues with
executing their coaching ideas, for example: “I wish I had more experience with
regards to the balancing act between the actual prototype and the requirement of the
194                                                                       F. Häger et al.
5.1.2 Tools
When asked about tools that were used during the coaching session most coaches
mentioned the whiteboard, the handout and/or slides, and the timer. Three of the
coaches found that the whiteboard was useful to the collaborative work of the team
as the following statement describes: “The whiteboard was really useful in discus-
sions and ideation.” Two of the coaches found the documentation in the form of
handouts or slides useful for themselves as a guide but not so much for the team.
One coach described this as follows: “[The] handout was only helpful for myself,
not really for the team at the time of the coaching session.” In contrast, other
coaches found the documentation very helpful for the team as a summary and a
reminder for the future, for example: “I think the most helpful artifact, that resulted
from the coaching is the handout I prepared for the team members. By having all
important information as a rule set at hand, it will be easier to remember and act on
the advice given in the next user testing.” It can be argued that the difference here
depends on the amount of theoretical information needed for some topics as
opposed to other rather practical topics where team and coaches mostly worked
together and a summarizing document is less needed. The timer was only men-
tioned by two coaches and appeared only important for sessions with a schedule of
inputs and exercises and not so much for sessions mostly based on discussions.
   Interestingly, most coaches described tools they used with the team during the
coaching session. However one of the coaches described a tool he used by himself
alone: “I essentially prepared a full discussion agenda with all points that I would
want to bring across plus possible responses on the team’s side that I could use to
follow, skip parts that were not applicable or deep dive into others they were
interested in. That was very helpful to keep track of the content, to not diverge
too much and to always know what to say next.” This idea seems very helpful for
topics that involve longer discussions.
Naturally, different coaching topics require a different level of theory and practice.
When asked to rate the ratio between theory and practice in their coaching session
most coaches described the level as corresponding to the topic, as the following
statements reflect: “The topic being very theoretical in nature, there was only
limited practical coaching to do. I saw no need for practical coaching for this
topic.” and “The team was already aware of the goal of the theory, so I needed
less time to introduce it to them and we were able to focus more on the prototype
itself.”
Experience and Knowledge Transfer Through Special Topic Coaching Sessions           195
5.1.4 Summary
Overall the alumni valued the experience of coaching. One coach stated “I really
like the opportunity to actual[ly] coach, which is still rare in university in general.”
Another coach wrote “I like the idea of a coaching seminar. I also like, that the
ME310 alumni are still part of the ME310 family, with the coaching seminar more
than ever.” This highlights, that the seminar provides an opportunity for the alumni
to stay connected with the course creating more of a community between alumni
and current students.
During the global design course each of the three teams was coached by two or
three alumni coaches. One coaching session even involved students from the global
university who were visiting during the time of coaching. After each session, all
participating students were asked to fill out the questionnaire. We received 21 of
31 possible responses. For each coaching session at least two students answered the
questionnaire. Similar to the coaches, the students were asked to describe positive
and negative aspects of the coaching session, tools used and whether the tools were
useful and the ratio of theory and practice during the coaching session.
The major positive aspect the students saw in the coaching sessions included
obtaining input on a topic to help them with a current problem or need. For
example, students said: “(the session) structured our process of finding interview
partners,” “useful coaching which helped us to explain our project topic better” or
“we gained really good insights into testing especially with elderly people.” Neg-
ative aspects were found mostly in the timing of the coaching sessions as most
students wished to have had the sessions earlier in the project. This can be seen in
the following reflections “I would have liked the coaching a little earlier, as some
points are really valuable . . . especially pointing out some points again, that we
forgot over the time.” or “the whole coaching session was maybe a little too late in
our process and we couldn’t use as much as we would have liked to” This lateness is
due to the fact that the global design course starts about a month earlier than other
courses at our university, while coaching commenced at the beginning of the
semester.
196                                                                        F. Häger et al.
5.2.2 Tools
The student’s evaluation of tools used during the coaching session matches the
coaches’ evaluation. The whiteboard and the documentation in the form of hand-
outs or slides were most mentioned. The students said: “structuring [the] concept on
whiteboard was quite helpful” and “visualizing the different priorities for our
project [on the whiteboard] was really nice for understanding the team.” Students
therefore agree with coaches that whiteboards are a good tool for collaborative
coaching sessions. The handout was described as “pretty helpful” or “very helpful
for future reference”. Unlike the coaches, no student experienced the handout as not
being helpful. While the timer was mentioned as a tool in use, no comments were
made on its usefulness. This is probably because students don’t realize the impor-
tance of timing in cases where the coach has a good time management.
Similar to the coaches, the students found the ratio between theory and practice
well-suited for the respective topic as reflected by statements like “[It was] a good
mixture between exercises and theory.” or “[The coaching was] very practical,
which is good for this topic because I think there is not too much theory.”
5.2.4 Summary
Overall the students valued coaching from course alumni highly as it gave them
insight into former projects and provided them with additional input when they
needed it. For example, “[the coaching] was in the perfect project phase” or “the
general theme was awesome as we [as a team] have problems with it”.
Complementary to the text based questions, students and coaches were asked to rate
the coaching session in general and the ratio between theory and practice on a Likert
Scale ranging from 1—“Very good” to 5—“Not good at all”. Additionally, the
students were asked to rate how well the coaching session fit into their current process.
Figure 3 depicts the general student (S) and coach (C) ratings for the seven coaching
session in a box-and-whisker plot. As can be seen, all coaches rated their session
inside the range of student ratings. The rating of the coaches from coaching session
Experience and Knowledge Transfer Through Special Topic Coaching Sessions                                       197
         4
                                                                  3
         3                                                        X
                        2                           2                           2   2         2
         2              X                           X                           X   X         X
                                      1
         1                            X
         0
              S         C   S         C   S         C   S         C   S         C   S         C   S         C
CS1, CS3 and CS6 are within the second and third quartile of student ratings. The
rating of the coaches from coaching session CS2, CS4 and CS5 correspond with the
outliers of student ratings. For these coaches it would be recommended to reflect on
the students’ textual answers to see why most of them rated the session better or
worse than the coaches themselves.
Figure 4 shows the ratings of the theory-to-practice ratio of students and coaches for
the seven coaching sessions. As can be seen, coach and student ratings differ more
on this topic as only the coach rating from session CS4 is between the second and
third quartile. The coaches from sessions CS2, and CS5 correspond with the worse
outliers of student ratings. And the coaches from sessions CS1 and CS6 rated the
ratio worse than any of the students. In CS1 the coach would have liked to have
found some more time for theoretical inputs, while the coach from CS6 would have
liked to have had more time for a practical session with the team. Only the coach
from session CS3 rated the ratio better than most of the students. It would be
interesting to look further into the different ratings on this topic by interviewing
students and coaches again. However, we think the ratings and the associated
comments by the coaches and students indicate that the coaches have higher
expectations towards their coaching session and can’t fit everything they would
like into the time slots for coaching sessions. Conversely, the students find the
provided input sufficient.
Figure 5 depicts the rating by the students of how well the coaching session fit into
their current process. As can be seen, the student ratings are the worst here. As the
comments in Sect. 5.2.1 describe, this is due to the fact that most students would
198                                                                                                          F. Häger et al.
         5
                                                                                               4
         4                                                                                     X
                         3
         3               X
                                       2                           2             2
         2                             X                           X             X
                                                     1
         1                                           X
         0
               S         C   S         C   S         C   S         C   S         C   S         C   S         C
Fig. 4 Theory vs. practice rating of coaching sessions by coaches and students
have liked the coaching sessions to take place earlier in the project in order to make
them even more useful.
As described in Sect. 4 each coaching session was observed by one of the course
advisors. This was done for three reasons:
• To be able to grade the coaching session,
• To be able to intervene in case the coach is giving wrong input, and
• To evaluate the quality of the coaching sessions.
Experience and Knowledge Transfer Through Special Topic Coaching Sessions            199
   Overall we found that all coaches were able to gain empathy for the team from
their preparation session and managed to provide a coaching session with a useful
topic for the team. This is also reflected by the coaches and students evaluation.
Similar to the coaches themselves and the students we found the ratio of theory and
practice fitting for the topics. However, for some of the more theoretical topics, a
few exercises could have helped to make the information more tangible, for
example, writing a mail to request interviews. Additionally, we noticed some issues
in guiding discussions where it seemed the alumni coaches were missing experi-
ence in such situations. For example, in one coaching session the students were
discussing the same arguments over and over while the coach was just standing
there listening and the advisors had to step in to end the pointless discussion. In a
different session the students seemed very tired and the coach had a hard time
getting them to participate instead of just listening to him. In this session the advisor
did not intervene but noted that providing a set of warm-ups and similar tools would
be helpful for the alumni coaches.
Overall the seminar was perceived positively by teams and coaches. The inputs
provided by the alumni gave students the opportunity to discuss their project and
problems with peers. Additionally, the inputs were perceived as valuable and
relevant as they were mostly aligned with the team’s process and gave firsthand
insights into former projects. The seminar provided the coaches an opportunity to
reflect on their own projects and pass on the knowledge gained therein. Furthermore
the seminar made the coaches feel they are still part of the community even though
they are not part of the design course as such. Thus the concept of the special topic
coaches achieved what we had hoped for in our first implementation.
    The positive feedback encourages us to continue this seminar in the future.
However, the evaluation uncovered some flaws in our execution, so we plan on
some adaptations to the course for the next iteration.
    We noticed a couple of actions we would like to collect as “best practices” for a
rerun of the seminar. These include:
• Taking some time to get to know the team’s current status and problems upfront,
  e.g. through a personal meeting with the team or reading the latest
  documentation.
• Starting with a warm-up if the team seems to be sleepy or distracted.
• Discussing the agenda for the coaching session at the beginning with the team.
• Ending a coaching session with action items for the next coaching or team
  session.
• Incorporate small exercises for theoretical inputs where possible.
  We agree with the coaches’ assessment, expressing their unpreparedness for
some coaching situations, such as long running discussions or dealing with a sleepy
200                                                                                  F. Häger et al.
and distracted team. Therefore we will extend the introduction of coaching guide-
lines, for example by creating a kick-off workshop with exercises to help them.
   Furthermore students and coaches found that some of the topics were coached
too late. In the upcoming project year we will start the coaching seminar in parallel
with the global design course allowing the coaching session to take place earlier
when the teams most need them.
   As discussed in Sect. 5.3.2 the coaches’ rating of the theory to practice ratio
indicates that they had higher expectations towards their coaching sessions, namely
giving more theoretical or practical input than could be achieved in the timeslot. We
plan to address this issue by allowing the coaches to give a more detailed theoretical
input to all teams and following up with a practical coaching session.
   One of the coaches would like to offer a similar seminar for other design courses
at our chair. Our approach has proven valuable to students and coaches and helped
to enhance the global project course with additional coaching sessions. The teach-
ing staff could not have managed so many individual coaching sessions on their
own. Therefore, we firmly believe such a seminar can be valuable for further design
and engineering courses at our or other universities.
   We also believe that the concept of special topic coaches can be applied to
project-based design teams in companies. There it can help to support knowledge
sharing between teams and projects and create a stronger sense of community
between former and current team members.
References
Carleton T, Leifer L (2009) Stanford’s ME310 course as an evolution of engineering design. In:
   Proceedings of the 19th CIRP design conference—competitive design, Cranfield University,
   30–31 Mar 2009
Gerber E, Olson J, Komarek R (2012) Extracurricular design-based learning: preparing students
   for careers in innovation. Int J Eng Educ 28(2):317
Grimheden M, Van der Loos M, Chen H, Cannon D, Leifer L (2006) Culture coaching: a model for
   facilitating globally distributed collaborative work. In: Frontiers in education conference, 36th
   annual, pp 15–20
Hackman J, Wageman R (2005) A theory of team coaching. Acad Manag Rev 30(2):269–287
Hiremath M, Sathiyam V (2013) Fast train to DT: a practical guide to coach design thinking in
   software industry. In: INTERACT (3), pp 780–787
Kim K-J, Bonk C, Teng Y-T, Zeng T, Oh E (2006) Future trends of blended learning in workplace
   learning settings across different cultures. In: 29th annual proceedings, Dallas, TX, vol 1, pp
   176–183
Ledsome C, Dowlen C (2007) Design coaching. Presented at the international conference on
   engineering and product design education, Northumbria University, Newcastle Upon Tyne,
   UK
Meyer J, Harling F (2012) Die harten Nüsse anders knacken: Design Thinking als
   Innovationsmethode für Wissensmanagement. In: KnowTech—Neue Horizonte für das
   Unternehmenswissen—Social Media, Collaboration, Mobility, Germany, pp 71–79
Pretorius CJ, Steyn H (2005) Knowledge management in project environments. S Afr J Bus Manag
   36(3):41–50
Experience and Knowledge Transfer Through Special Topic Coaching Sessions            201
Reich Y, Ullman G, Van der Loos M, Leifer L (2007) Perceptions of coaching in product
   development teams. In: CDROM proceedings of the 16th international conference on engi-
   neering design (ICED), The Design Society
Reich Y, Ullman G, Van der Loos M, Leifer L (2009) Coaching product development teams: a
   conceptual foundation for empirical studies. Res Eng Des 19(4):205–222
Shinoda AM, Maximiano AA, Sbragia R (2013) Knowledge management in project-oriented
   organizations. In: BALAS annual conference, Universidad ESAN, Lima, Peru
Smart Documentation with Tele-Board MED
maps to grasp the central theme in historical developments (d.school 2011, p. 16).
However, there are no mathematical deductions, no formal theories, no category
systems—all in line with the mottos “structure without formalizing” and “focus
without formalizing” (von Thienen et al. 2014b, p. 102). Human stories should
always be tangible, even in the most sharp-witted analyses.
    Traditionally, design thinkers used analogue whiteboards and sticky notes made
out of paper as central documentation tools. However, a digital whiteboard software
application called Tele-Board has been newly created for design thinking project
work (Gumienny et al. 2011, 2012; Gericke et al. 2012). It brings along many
advantages. In the past, teams had to clean their whiteboards regularly to make
room for new pieces of information. Tele-Board, however, saves all stages of
progress on a whiteboard. This way, teams don’t lose their information overview
from an initial exploration phase when they move on to ideation (thinking about
potential solutions), prototyping or testing. Furthermore, when teams work with
analogue whiteboards and sticky notes, most information is thrown away at the end
of projects. Tele-Board, however, saves project documents without consuming
extra space physically—both while projects are running as well as when they are
finished. Finally, analogue sticky notes defy automated search, copying or elec-
tronic posting. Because content is stored digitally in Tele-Board, teams can easily
search for pieces of information automatically, export information to text or picture
files, copy information and send it anywhere in the world they want. Thus, Tele-
Board helps design thinkers enjoy all the advantages of digital work. At the same
time, the system is tailored to create a work feeling that resembles analogue design
thinking work as much as possible, for example a digital pen can be used instead of
having to type. The work flow, hand-movements or eye-contact—everything which
feels good in the process of documenting with analogue tools should feel just as
good when documenting digitally with Tele-Board.
Behaviour psychotherapists share most of the needs that design thinkers have and
they already apply many similar solutions too. However, there are a few
differences.
Touching upon private issues when talking to “strangers” Just like design
thinkers who try to establish and maintain an empathic rapport with interviewees,
psychotherapists seek an empathic rapport with patients. In both cases, the flow of
private information is rather one-directional, i.e. from patient to therapist in the
medical context and from interviewee to interviewer in the context of design
thinking. Furthermore, in both cases the person who is being interviewed typically
has no personal relationship with the interviewer. In this sense it is an information
Smart Documentation with Tele-Board MED                                            207
Seeking experiences away from the whiteboard Just like design thinkers, ther-
apists and patients often spend time away from the whiteboard—in case there is one
in a therapeutic practice. Maybe the patient tries autogenic training. In another
session, the patient might put down stones and flowers across the room to recon-
struct his “life line” including major burdens (stones) or uplifting experiences
(flowers). Or they could work with angst-inducing stimuli to reduce avoidance
behaviour. In every such case the focus is typically on experiences. The “work-
flow” has priority over documentation. Detailed notes regarding such sessions are
unlikely to emerge. Rather, photos or videos can help capture important content
without disturbing the flow.
Analysing by building on psychotherapeutic theories Like design thinkers,
therapists invoke schemes to analyse information. However, the schemes used in
psychotherapy tend to be less playful. They built on psychotherapeutic theories and
are relatively formal.
Ensuring confidentiality of data While design thinkers typically share their
findings with broad audiences, therapists barely communicate with anyone regard-
ing the information they obtain. In behaviour psychotherapy, the protection of
patient data is at least as important as in all other healthcare domains. The
professional code of secrecy states that everybody who works in health services
has to keep strictly confident all information obtained concerning the patient
(Bundesärztekammer 2008a, b; Leiner et al. 2009). This implies that medical
records have to be stored in such a way that they cannot be accessed by third
parties. In practice, the access to paper-based records is physically restricted as they
are stored in a locker. In digital documentation, electronic measures are essential to
protect records from unauthorized access, e.g., through data encryption and pass-
word requests.
    In sum, design thinkers and behaviour psychotherapists share many documen-
tation needs and they already share many solutions too. In particular, documenta-
tion strategies must go hand-in-hand with empathic conversations. There have to be
documentation strategies when the information flow is almost entirely
one-directional (as a patient or user reveals private experiences), when the infor-
mation flow is back and forth (such as when patient and therapist carry out an
analysis together and design thinkers exchange information on a whiteboard) or
when the focus of work is not on documentation at all (because a patient practices
autogenic training, for instance, or design thinkers seek first-hand experiences
regarding their challenge).
    However, there are also three major differences regarding documentation needs
and solutions. In particular, data safety is generally a lot more important in the
psychotherapeutic setting. Different kinds of analysis schemes are used. Finally,
psychotherapists need to generate bureaucratic case reports while design thinkers
do not.
Smart Documentation with Tele-Board MED                                           209
in the new field of application. Features that might more likely cause confusion than
be of assistance have been deactivated in Tele-Board MED.
Changing Tele-Board The graphical user interface had to be redesigned. Tele-
Board MED has to organize patient files whereas Tele-Board organizes design
thinking projects. Thus, the terminology of user menus and help files had to be
adapted.
   While the implementation of rigorous data security standards has been an
important step in the creation of Tele-Board MED (Perlich et al. in press), this
chapter focuses on documentation concerns. The following sections will discuss
how research informed the development of new documentation tools in Tele-Board
MED and how Tele-Board MED supports either behaviour psychotherapists or
design thinkers.
Building on the design thinking approach of using visuals, i.e. small graphical
elements and short verbal notes, a basic structure was developed for Tele-Board
212                                                              J.P.A. von Thienen et al.
MED session protocols. In terms of verbal content, protocols were supposed to rely
solely on utterances made explicitly by the patient or the therapist in a session. Thus,
no delicate thought of the therapist and no abstract reflection on the process (e.g.,
regarding patient resistance) would enter the protocol. Patients could read such
protocols without being surprised by their content; everything should sound familiar
to them.
    This protocol structure was tried out and refined in more than 30 therapy
sessions. A member of the Tele-Board MED team created protocols of therapy
sessions manually based on the suggested protocol structure. These protocols were
then given to the therapists who had led the corresponding therapy sessions to
comment on protocol accuracy and utility. After a couple of iterations the basic
protocol structure did not change much anymore.
    In a feedback study (von Thienen et al. 2014a) the envisioned Tele-Board MED
protocol function including a typical end result, i.e. a sample session protocol, was
shown to 34 behaviour psychotherapists. Among several suggested new Tele-Board
MED features, the therapists considered the protocol function most useful. On a
scale of 1 (marginally useful) to 4 (highly useful) it got an average rating of 3.1.
    In terms of qualitative feedback, the protocol function was generally considered
helpful in remembering prior sessions quickly. However, study participants also
addressed two shortcomings. (1) Session protocols should include some things that
are not explicitly verbalized in therapy sessions like psychotherapeutic interven-
tions/methods used or the process phase. (2) There should be a way to document
things which the patient should not be able to read. In particular, there should be
protocols which document problems of the therapist. Such protocols could help to
prepare for counselling sessions with experienced therapy supervisors.
    Based on this feedback, the general protocol structure was revised once more. It
should be easy to include therapeutic methods and the process phase of therapies in
Tele-Board MED session protocols. At the same time, standard session protocols
should still be easily readable for therapists and patients. No content should appear
in session protocols that is potentially offensive or unforeseeable (“out of control”)
for patients.
    A corresponding protocol function has been implemented in Tele-Board MED.
The system can now generate protocols which include methods used, process
phases and most important work results automatically when therapists follow
some simple conventions in their work routines. As they use digital whiteboards
cooperatively with patients, therapists can place a sticky note with a short descrip-
tion of the therapeutic method being used in the upper left corner of each page (or,
rather, each whiteboard panel). For instance, such a sticky note can say “autogenic
training” or “distinguishing between functional and dysfunctional beliefs”. Many
such sticky notes are already prepared as part of readily available therapy templates
(see Sect. 3). Furthermore, in the lower right corner of each page (or panel) an icon
can be placed to visualize the therapeutic method or the process phase of the
treatment. Finally, therapist and patient can mark sticky notes or pictures as
particularly important to include them straightforwardly in session protocols.
Smart Documentation with Tele-Board MED                                                      213
Fig. 3 Figure illustrates the revised format for automatic protocols, showing an example from one
particular therapy session. The protocol indicates therapeutic process phases, methods used and
most important session content/work results
When doctors work with analogue patient files, these files typically contain sheets
of paper with notes. Tele-Board MED stores patient records digitally. They contain
whiteboard panels (corresponding to paper sheets) with notes, sketches and images.
   In general, therapists can of course oversee the files of all patients they treat.
However, during a therapy session, while a patient is being treated, only the file of
this particular patient should be visible. The patient should not see files of other
patients. Therefore, a new treatment session mode has been implemented in Tele-
Board MED. At the beginning of each session, the therapist selects the pertinent
214                                                                   J.P.A. von Thienen et al.
Fig. 4 A protocol template prepared for therapists to be filled out after treatment sessions—in
particular when unexpected problems have occurred. The information gathered here is not
intended for sharing with patients. Rather it should help in the preparation for supervisions
patient file, presses a “start session” button and all other patient files are hidden. At
the end of each session, the therapist presses the “end session” button and can now
access other patient files again.
    Such a treatment mode is also the basis for the new protocol function: Each
protocol provides an outline of all panels which were opened in a treatment session,
i.e. while the treatment mode was on.
    Figure 5 shows an example of how therapists can use the treatment session mode
and create protocols subsequently.
    In the protocol, all panels that have been opened during a session are listed
chronologically. The panel that was opened first is listed first. Each panel (work
sheet) can then be described and summarised in a number of ways. Users can decide
whether or not to include a picture of the panel. Building on the features of the Tele-
Board whiteboard client (Gericke and Meinel 2012; Gumienny et al. 2011), pinned
or unpinned sticky notes can be included or excluded. Sticky notes can also be
included when they have been marked as “important” by therapist or patient.
    Furthermore, general information on the session can be included. Once again,
users can adjust protocols according to their own preferences. For example, ther-
apists can decide whether or not their protocols should include a reference to the
patient, start and end times of therapies, panel names (corresponding to the head-
lines of paper sheets in analogue documentation) and so forth.
    Technically, a protocol is a normal panel. Therefore, the automatically created
protocol is not carved in stone. Its content can be modified easily by the user.
Smart Documentation with Tele-Board MED                                                             215
Fig. 5 Navigation through the user interface to create a protocol: (1) The therapist clicks on “Start
Session”. A clock starts running, indicating the length of the session. (2) Throughout the session,
therapist and patient can work on one or more panels. In the example shown here, three panels are
opened (however, the opened panels themselves are not shown in this figure). When the “End
Session” button is pressed, a list of all sessions can be viewed that the therapist has had so far with
this patient. (3) The therapist can select one session to open the corresponding protocol. She can
either choose to see all options of automatic protocol creation again (4); in this case, she also sees a
preview of how the protocol will look like depending on the options that she picks. Or (5) she can
216                                                                  J.P.A. von Thienen et al.
In some ways, psychotherapeutic treatment sessions are meetings like many others:
They start with initial hellos and introductory conversation, giving way to the main
part of talking and eventually end with closing statements and saying goodbye.
   There is a metaphor for this structure of typical conversations (maybe more
prominently known as a metaphor in text writing); which is the “sandwich meta-
phor”. We believe this metaphor can also help to describe ways in which Tele-
Board MED protocols support psychotherapists: There is a main conversation part
(“roast piece”) which is embedded in intro- and outro-conversation (“two buns”).
   Almost anything can make a tasty sandwich as long as it is wrapped well in a
bottom and a top bun. For therapy sessions, this means that a therapist will not
launch a behaviour analysis before saying hello first. Rather, a therapist will start by
welcoming the patient, asking how he or she is doing. A connection can be made to
the last session; an overview of the current session can be given and so forth to
create a pleasant intro. Correspondingly, treatment sessions should rather not end
abruptly by simply dismissing the patient when time is over. Rather, summing up
the session, agreeing upon some homework, discussing what one found particularly
important etc. can help to create a smooth outro.
   Tele-Board MED protocols should help prepare salubrious sandwiches. By
building on the last sessions’ protocol, therapists can easily recap where the team
left off when starting a new session. Then, therapist and patient can document their
new conversation digitally with little effort. In the end, automatic protocols can be a
tool to close sessions smoothly. Instead of just sending the patient home, therapist
and patient can look at the protocol jointly, changing it if necessary. The question
“what should be in the protocol?” can serve as a prompt to quickly recapitulate the
session. Therapist and patient can shortly reflect on issues they found most impor-
tant. Afterwards, the therapist might even print out the resulting session protocol for
the patient if he or she wants to take it home.
Fig. 5 (continued) start the protocol immediately—based on default options or previously saved
settings. The protocol opens up as a new whiteboard panel
Smart Documentation with Tele-Board MED                                                        217
automatic protocols was introduced in line with the “sandwich metaphor” (see
Sect. 2.4) and several therapeutic use cases were demonstrated (see Sect. 3).
    The sample of therapists who took part in this study resembles the sample from
our prior test in 2013 (von Thienen et al. 2014a). Two therapists took part in both
studies.
    One aspect of the sample that is particularly important for the interpretation of
results is the attitude towards technology. It is known that people who are
technology-enthusiastic generally assign higher utility ratings to digital tools such
as Tele-Board compared to people who are averse to technology (Chuttur 2009). In
our sample from 2013, the average attitude towards technology was slightly
positive: 0.68 in concrete numbers on a scale from 2 (hostile towards technology)
to 2 (enthusiastic about technology). Using the same scale in 2014, the average
attitude was slightly positive once again: 0.7 in concrete numbers, thus almost
identical to the sample from 2013.
    The average age was somewhat higher in the new study (38.2 years in 2014
versus 35.4 years in 2013). Correspondingly, the new sample group also has longer
work experience. In 2014, 40 % of the therapists had a work experience longer than
2 years—compared to 27 % in 2013.
    Regarding their present documentation habits and tools, most therapists in our
study state they take handwritten notes while treating patients in sessions. The
majority (60 %) writes an average of between ¼ and ½ of a DIN A4 page per
session.
    30 % of the therapists state they usually don’t add any more notes after sessions.
40 % typically add notes of about one quarter of a page, 30 % usually add one half
of a page or more.
    For the time being, both during sessions and afterwards handwriting is the most
common means of documentation. 70 % of the therapists don’t ever use digital tools
for their own case documentation either in or after treatment sessions.
    Taking notes in treatment sessions typically leaves therapists with a somewhat
unstructured session documentation. Most therapists go over their notes once again
and create structured session protocols at least for some treatment sessions (see
Fig. 6).
        5
        4
        3
        2
        1
        0
            almost never up to 10% of        10-50% of       50-90% of       more than
                           sessions           sessions        sessions         90% of
                                                                              sessions
Fig. 6 Therapists answer the question “Have you created ‘orderly’ session protocols in the past, in
addition to ‘spontaneous’ notes taken in or after sessions?”
218                                                                    J.P.A. von Thienen et al.
        6
        5
        4
        3
        2
        1
        0
            up to 5 minutes     5 - 15 minutes    15 - 60 minutes       1 - 2 hours
Fig. 7 Therapists answer the question “How much time do you typically need to write one session
protocol?”
   Figure 7 shows how much time therapists typically spend writing one session
protocol. For the therapists in our sample, one therapeutic treatment session takes
50 min. This means, documenting one therapy session by writing a protocol takes
about half as long as the therapy session itself. Thus, it is a considerable investment
of time on behalf of the therapists.
   We asked the therapists why they write protocols, suggesting the options named
in Table 1. There was also the opportunity to add further purposes in free text fields;
however no further purposes were added. Table 1 shows the hierarchy of reasons for
writing session protocols as specified by the therapists.
   Table 1 shows that adhering to legal documentation demands is quite important
to the therapists; it is the highest ranked purpose. Handing session documentation
out to patients—as though part of new legal demands—is the least important
purpose for writing session protocols. Furthermore, joint documentation with the
patient figures at the second lowest position.
   In line with these findings, 70 % of the therapists state they have never handed
out session protocols to patients. The other therapists state they have given out
protocols to patients already, but just in very few cases. Against this background,
the Tele-Board MED protocol function seems to make a great difference.
Smart Documentation with Tele-Board MED                                                      219
   To start off, all therapists think they would work with session protocols regularly
when supported by the automatic Tele-Board MED protocol function (Fig. 8). This
makes a considerable difference given that so far, without Tele-Board MED, some
therapists don’t work with session protocols on a regular basis.
   Furthermore, therapists are likely to share their session protocols with patients
much more frequently when using Tele-Board MED (see Fig. 9).
   One issue that is often sensitive for automated functionality is the quality of
results. For instance, the program MS Word offers automatic summaries of text
documents. However, these automatic summaries are often cumbersome. They
don’t meet the quality of human-written text summaries.
   In Tele-Board MED, there are some minute actions that therapists can take to
prepare valuable automatic protocols. For instance, it helps to give comprehensible
Fig. 8 The light bars show how often therapists write session protocols nowadays. The dark bars
show estimates of the therapists: how often they would work with session protocols when
supported by the automatic Tele-Board MED protocol function
Fig. 9 Before Tele-Board MED had been introduced, therapists were not in the habit of sharing
session protocols with patients. The light bars show answers to the question: “Have you ever
handed out session protocols to patients?” With Tele-Board MED, therapists are likely to hand out
session protocols to patients. The dark bars show answers to the question: “Do you think you
would hand out Tele-Board MED session protocols to patients?”
220                                                                    J.P.A. von Thienen et al.
Fig. 10 Without Tele-Board MED, therapists use manually-created session protocols for a limited
number of purposes. The light bars show answers to the question: “For what purposes do you
create session protocols?” Answers range from 0 (not so important) to 3 (very important). By
contrast, dark bars show answers to the question: “For what purposes would you use automatically
created Tele-Board MED session protocols?” Once again, answers range from 0 (not so important)
to 3 (very important)
names to new panels or to work with prepared templates. Such minute actions had
been included in the demo of our feedback study. Thus, easy to read protocols were
obtained even though the content was not pre-planned; the therapists could enter
information spontaneously.
   We asked the therapists for the ways in which they would want to use Tele-
Board MED protocols. It was striking that they found more reasons to use such
automatic protocols than they had seen reasons for self-created protocols (Fig. 10).
Thus, automatic Tele-Board MED protocols seem to be at least as serviceable as
self-written protocols: They seem to serve even more purposes.
   Remarkably, the therapists show strong consensus when discussing reasons for
using Tele-Board MED protocols. The highest standard deviation among all seven
purposes (as shown in Fig. 10) is 0.78. By contrast, the therapists seem to have
rather dissimilar reasons for creating session protocols manually. As shown in
Table 1, standard deviations often exceed 1.0 when therapists report their reasons
for (manually) writing protocols.
   With Tele-Board MED, handing out session protocols to patients or
summarising session content cooperatively with the patient become much more
recognized purposes (cf. Fig. 10). The average rating of importance for handing out
session protocols to patients was 0.78 without Tele-Board MED; the average rating
of importance for summarising session content cooperatively with the patient was
1.1. Both numbers move up to 2.11 with Tele-Board MED, on a scale of 0 (not so
important) to 3 (very important).
Smart Documentation with Tele-Board MED                                          221
   Taking into account qualitative feedback as well, we conclude that the protocol
function is strongly appreciated by the therapists. It allows them to create official
session protocols much more frequently than before—given that writing protocols
has been a time-consuming task up to the present. Furthermore, automatic Tele-
Board MED protocols can serve at least as many purposes as present-day, human-
written protocols. Finally, Tele-Board MED helps to meet important goals of
patient empowerment initiatives or patient rights laws. Therapists say they would
share treatment protocols with patients when supported by Tele-Board MED. Thus,
documentation can become much more transparent and cooperative with the new
technical possibilities.
To pay for treatments, health insurance companies typically demand elaborate case
reports. These reports should clarify whether the patient does indeed have a
recognized mental illness, which is usually considered the case if he or she fulfils
a sufficient number of diagnostic criteria specified either in the International
Statistical Classification of Diseases and Related Health Problems (ICD) by the
World Health Organization (e.g., 1992) or the Diagnostic and Statistical Manual of
Mental Disorders (DSM) of the American Psychiatric Association (e.g., 2013).
Furthermore, the proposed treatment needs to be promising; its efficacy should be
scientifically confirmed.
222                                                                  J.P.A. von Thienen et al.
Fig. 11 The template overview shows a screenshot of each template. A verbal description
explains what each template is typically used for. Templates which help to gather or analyse
information are categorised according to the rubrics of case reports, when such information is
requested
Smart Documentation with Tele-Board MED                                              223
Fig. 12 Tele-Board MED makes it easy to gather and sort information. Using the Tele-Board
MED Word Export function, all information can be transferred to a text file immediately
   Using an MS Word export function, information that has been gathered in Tele-
Board MED can be exported to Word easily. Each Word file contains headlines
followed by lists of keywords or short statements (Fig. 12). Thus, to obtain a complete
case report, therapists simply needs to cast the keywords into a running text.
   Compared to documentation procedures formerly used in psychotherapy, there is
(1) no need to search for information in long handwritten files, (2) no need to
digitalize handwritten information, (3) no surprising discovery that one forgot to
ask about some pieces of information that are required in a case report.
   While the templates provided by us are tailored to support behaviour psycho-
therapies, additional templates can be created easily. Thus, psychotherapists who
follow approaches other than behaviour psychotherapy, or medical practitioners of
basically any field, can easily assemble templates which ease their work as well.
The provided templates can also be redesigned, i.e. therapists can easily adjust,
amend or change them completely according to their own preferences.
224                                                                    J.P.A. von Thienen et al.
Behaviour psychotherapy is well equipped with analysis schemes that have proven
helpful in decades of psychotherapy practice and research. The SORKC model
(Kanfer and Saslow 1965), the ABC analysis (Ellis 1973) or plan analysis (Caspar
1986) are just some examples. Tele-Board MED provides several templates which
support popular analyses. For instance, there are short and long versions of the
SORKC model. There are also different types of examples to account for differing
preferences and thinking styles of patients such as examples with animals as
opposed to examples with machines (see Fig. 13).
   Apart from behaviour psychotherapy, design thinkers also use many analysis
schemes that have proven helpful in years of practice. We have adapted several
design thinking based analysis schemes for psychotherapy sessions. In particular,
there are templates which support processes of problem solving (von Thienen and
Meinel 2014, in press).
   Tele-Board MED analysis templates help to address four needs at once. First, the
templates help to introduce, explain and carry out an analysis. Second, the analysis
result is perfectly documented without any extra effort: All content is stored
automatically. It can be opened up and elaborated on in forthcoming sessions.
Third, a session protocol is created automatically which states that this analysis
has been carried out in that session. Fourth, each analysis can easily be sent to a
Word file and included in a case report.
Fig. 13 To explain the SORKC model to patients with differing thinking styles, the therapist can
choose between several analysis templates. Here, two templates for short SORKC analyses are
shown, with animal or machine examples
Smart Documentation with Tele-Board MED                                                    225
Fig. 14 A sample exercise template. It includes a brief tutorial for novices (four blue/dark
instruction cards on the left) plus a short methodological description of the procedure and its
goals (yellow/light method card in the upper left corner) (Color figure online)
226                                                                   J.P.A. von Thienen et al.
Fig. 15 A sample template for psycho-education. Here, a vicious cycle of anxiety is described
Smart Documentation with Tele-Board MED                                                     227
Fig. 16 Session protocols such as the example shown here can be created in a few seconds. Such a
protocol includes the date of the session, start and end time, name of therapist and patient
(or pseudonyms, if preferred), the session subject (such as “life line”) and a picture showing
work results—here the depicted life line of the patient
228                                                              J.P.A. von Thienen et al.
   In other cases, therapist and patient might use the last 5 min of a session to record
the insights they found most striking.
So far, documentation solutions have been sketched out which support behaviour
psychotherapists. However, this chapter will close by returning to the field where
Tele-Board (MED) originates from: design thinking. After all, new features that
have been developed for Tele-Board (MED) can be used by design thinkers too.
   Documentation needs are manifold in the design thinking community and so is
the research regarding new documentation solutions. We will quickly survey
several design thinking projects on documentation and discuss how Tele-Board
(MED) relates to some of their central concerns. In particular, we will outline how
the newly developed automatic protocol function can help to address important
needs that are intensively investigated by the design thinking community at present.
Retrospect understanding Gericke et al. (2012, 2014) want to support design
thinkers and externals who wish to understand creative sessions and their outcomes
in retrospect. This should be achieved by saving enough information while teams
are working so that questions which arise later on can be answered on the basis of
readily available information. By way of contrast, it should not be necessary to
contact design thinkers and interview them to understand past decisions or actions
in their design projects.
   To accomplish this documentation challenge, Gericke et al. (2012, 2014) have
developed the Tele-Board history browser (as described in Sect. 1.1). Tele-Board
stores all actions that teams carry out on digital whiteboards. The history browser
helps to review these activities such as adding sticky notes, moving them around or
deleting them. Furthermore, additional synchronized collaboration data, such as
videos and information regarding the audio level can help to identify or recall
important moments in the work process.
Finding important information artefacts and tracing project paths In the
course of design thinking projects, teams typically accumulate huge amounts of
information artefacts such as sticky notes, whiteboard drawings, pictures, paper
prototypes etc. These huge amounts of information can barely be taken into account
all at once. Both, design teams or externals who try to understand the path of project
progression can experience an information overload.
   To counter such an information overload, Voget (2013) suggests
ConnectingInfos—a software that stores and organizes information artefacts. “It
provides a project overview, where all acquired information is displayed on a time
line. ConnectingInfos supports the ability of seeing a big picture by highlighting
important information and displaying associations between information items.
Users can avoid the danger of information overload by reducing the amount of
Smart Documentation with Tele-Board MED                                             229
Visualising process phases, methods used and most important work results In
a project which is just starting, Giese (2014) and his team investigate “how to
capture and visualize employed DT methodologies at work to understand them in
detail. For example, this includes capturing employed methodology phases,
methods and techniques, as well as artifacts in which the outcome of certain
activities is manifested.”1 Since this project has only just begun, no concrete
documentation solution is suggested yet.
   The new Tele-Board (MED) protocol function addresses several needs that are
currently tackled in design thinking research. In terms of the output, our approach
resembles that of Giese (2014) and his team, as well as that of Voget (2013).
Automated Tele-Board (MED) protocols visualize process phases, methods used
and most important work results—such as sticky notes with insights or pictures of
prototypes (Figs. 3 and 5 showed sample protocols).
   Furthermore, many design thinkers assume that documentation should demand
as little extra work of teams as possible. Otherwise, documentation might not
happen at all. For instance, with respect to educational design thinking settings
Beyhl et al. (2013) remark: “The biggest challenge [. . .] is to get students to
document voluntarily” (p. 4). That is one of the reasons why the authors find that
a workable documentation solution “has to be integrated into the overall process
unobtrusively” (p. 4). For similar reasons, the documentation solution of Gericke
et al. (2012, 2014) does not demand any extra documentation activity of design
thinkers at all.
   The solution suggested by Menning et al. (2014) takes a slightly different turn.
Their approach does call for a considerable amount of documentation effort on
behalf of the teams. Protocols have to be filled out manually on a regular basis.
However, this approach envisions immediate benefits for the documenters. They
become more mindful of the process, of the reasons for design decisions, of
expanding knowledge or competencies.
   Depending on user preferences, Tele-Board (MED) protocols can be both
completely unobtrusive and labour-free or purposefully applied documentation
tools which support reflective team activities. To obtain labour-free protocols,
teams can simply carry out their project work on the digital whiteboards of Tele-
Board. Protocols of work sessions are then created and stored automatically.
However, teams can also work actively with the protocols, e.g. amend or refine
them. Thus, protocols can support reflective team activities in line with the design
thinking motto “be mindful of [the] process” (d.school 2011, p. 0). Accordingly, we
have suggested a usage of Tele-Board (MED) protocols building on the “sandwich
metaphor” in Sect. 2.4: Protocols can help recall earlier works, document current
activities and they can support reflections at the end of a session.
   With some minor adjustments, Tele-Board (MED) protocols could help to
address even more needs. For instance, design teams could be cued to reflect on
1
  https://hpi.de/de/dtrp/projekte/projekte-201415/the-design-thinking-methodology-at-work-cap
turing-and-understanding-the-interplay-of-methods-and-techniques.html
Smart Documentation with Tele-Board MED                                                    231
References
American Psychiatric Association (2013) Diagnostic and statistical manual of mental disorders.
   APA, Washington, DC
Aujoulat I, D’Hoore W, Deccache A (2006) Patient empowerment in theory and practice:
   polysemy or cacophony? Patient Educ Counsel 66(1):13–20
Beyhl T, Berg G, Giese H (2013) Towards documentation support for educational design thinking
   projects. In: Proceedings of the international conference on engineering and product design
   education, pp 408–413
Beyhl T, Giese H (2014) Connecting designing and engineering activities II. In: Plattner H,
   Meinel C, Leifer L (eds) Design thinking research. Building innovators. Springer, Berlin, pp
   211–239
Bundesärztekammer, Kassenärztliche Bundesvereinigung (2008a) Empfehlungen zur ärztlichen
   Schweigepflicht, Datenschutz und Datenverarbeitung in der Arztpraxis. Deutsches Ärzteblatt
   105(19):A 1026–A 1030
Bundesärztekammer, Kassenärztliche Bundesvereinigung (2008b) Technische Anlage zu den
   Empfehlungen zur ärztlichen Schweigepflicht, Datenschutz und Datenverarbeitung in der
   Arztpraxis. Deutsches Ärzteblatt 105(19):1–12
Bundes-Patientenrechtegesetz (2013) Gesetz zur Verbesserung der Rechte von Patientinnen und
   Patienten. Bundesgesetzblatt Teil I Nr. 9 ausgegeben zu Bonn am 25. Februar 2013
Caspar F (1986) Die Plananalyse als Konzept und Methode. Verhaltensmodifikation 4:235–256
Chuttur MY (2009) Overview of the technology acceptance model: origins, developments and
   future directions. Sprouts: working papers on information systems, vol 9, no 37
232                                                                       J.P.A. von Thienen et al.
d.school – Hasso Plattner Institute of Design at Stanford (2011) Bootcamp bootleg. http://dschool.
    stanford.edu/wp-ontent/uploads/2011/03/BootcampBootleg2010v2SLIM.pdf. Accessed 26 Nov
    2014
Ellis A (1973) Humanistic psychotherapy. The rational-emotive approach. McGraw-Hill,
    New York
Gericke L, Gumienny R, Meinel C (2012) Tele-Board: following the traces of your design process
    history. In: Plattner H, Meinel C, Leifer L (eds) Design thinking research. Studying co-creation
    in practice. Springer, Berlin, pp 15–29
Gericke L, Meinel C (2012) Understanding asynchronous design work—segmentation of digital
    whiteboard sessions. In: Proceedings of the IEEE 13th international conference on information
    reuse & integration (IRI), pp 700–705
Gericke L, Wenzel M, Meinel C (2014) Asynchronous understanding of creative sessions using
    archived collaboration artifacts. In: Proceedings of 2014 international conference on collabo-
    ration technologies and systems (CTS), pp 41–48
Giese H (2014) The design thinking methodology at work—capturing and understanding the
    interplay of methods and techniques. Abstract of a new project. https://hpi.de/de/dtrp/
    projekte/projekte-201415/the-design-thinking-methodology-at-work-capturing-and-understanding-
    the-interplay-of-methods-and-techniques.html. Accessed 26 Nov 2014
Gumienny R, Gericke L, Wenzel M, Meinel C (2012) Tele-Board in use: applying a digital
    whiteboard system in different situations and setups. In: Plattner H, Meinel C, Leifer L (eds)
    Design thinking research. Measuring performance in context. Springer, Berlin, pp 109–125
Gumienny R, Meinel C, Gericke L, Quasthoff M, LoBue P, Willems C (2011) Tele-Board:
    enabling efficient collaboration in digital design spaces across time and distance. In:
    Plattner H, Meinel C, Leifer L (eds) Design thinking research. Understand—improve—
    apply. Springer, Berlin, pp 147–164
Kanfer FH, Saslow GH (1965) Behavioral analysis: an alternative to diagnostic classification.
    Arch Gen Psychiatry 12:529–538
Kassenärztliche Bundesvereinigung (1992) Hinweise zum Bericht des Therapeuten und zur
    Begutachtung bei Anträgen auf Verhaltenstherapie. Deutsches Ärzteblatt 89(31/32):A1
    2642–2643
Koch S (2012) Improving quality of life through eHealth—the patient perspective. In: Proceedings
    of the European medical informatics conference (MIE 2012). Quality of life through quality of
    information. Studies in health technology and informatics, vol 180, pp 25–29
Lazarus RS (1999) Stress and emotion. A new synthesis. Free Association, London
Leiner F, Gaus W, Haux R, Knaup-Gregori P, Pfeiffer K-P (2009) Medizinische Dokumentation.
    Lehrbuch und Leitfaden. Schattauer, Stuttgart
Margraf J, Schneider S (1990) Panik. Angstanfälle und ihre Behandlung. Springer, Berlin
Menning A, Beyhl T, Giese H, Weinberg U, Nicolai C (2014) Introducing the Logcal: template-
    based documentation support for educational design thinking projects. In: Proceedings of the
    15th international conference on engineering and production design education, pp 148–154
Miller RH, Sim I (2004) Physicians’ use of electronic medical records: barriers and solutions.
    Health Aff 23(2):116–126
Müther M (2009) Bericht an den (VT) Gutachter. Schneller, leichter, kompetent. Deutsche
    Gesellschaft für Verhaltenstherapie, Tübingen
Perlich A, von Thienen JPA, Meinel C (2014) Supporting talk-based mental health interventions
    with digital whiteboards. In: Proceedings of the European medical informatics conference
    (MIE2014), pp 433–437
Perlich A, Sapegin A, Meinel C (in press) Implementation of data security requirements in a
    web-based application for interactive medical documentation. In: Proceedings of the interna-
    tional conference on health informatics (HEALTHINF)
Plattner H, Meinel C, Weinberg U (2009) Design thinking. Mi-Wirtschaftsbuch, München
Prochaska JO, Velicer WF (1997) The transtheoretical model of health behavior change. Am J
    Health Promot 12(1):38–48
Smart Documentation with Tele-Board MED                                                       233
Schauer M, Neuner F, Elbert T (2011) Narrative exposure therapy (NET). A short-term interven-
   tion for traumatic stress disorders. Hogrefe & Huber, Cambridge
Voget L (2013) An Association-based documentation tool to support design thinking groups in
   using their acquired knowledge. Master thesis, Hasso Plattner Institute at the University of
   Potsdam, Germany
von Thienen JPA, Meinel C (2012) Using Tele-Board to bring (some) design thinking into clinics.
   Part of the series: Tele-Board MED—rethinking patient files. Presentation available online at
   http://www.tele-task.de/de/archive/lecture/overview/7949
von Thienen JPA, Meinel C (2014) A design thinking process to tackle individual life problems
   (created for use in behaviour psychotherapy). Electronic colloquium on design thinking
   research: http://ecdtr.hpi-web.de/report/2014/002
von Thienen JPA, Meinel C (in press) Tele-Board MED: supporting creative problem solving in
   behaviour psychotherapy. In: Corraza GE, Agnoli S (eds) Multidisciplinary contributions to
   the science of creative thinking. Springer
von Thienen JPA, Meinel C, Nicolai C (2014a) How design thinking tools help to solve wicked
   problems. In: Plattner H, Meinel C, Leifer L (eds) Design thinking research. Building innova-
   tion eco-systems. Springer, Cham
von Thienen JPA, Perlich A, Meinel C (2014b) Tele-Board MED: supporting twenty-first-century
   medicine for mutual benefit. In: Plattner H, Meinel C, Leifer L (eds) Design thinking research.
   Building innovators. Springer, Berlin
World Health Organization (1992) The ICD-10 classification of mental and behavioural disorders:
   clinical descriptions and diagnostic guidelines. WHO, Geneva
Young JE, Klosko JS, Weishaar ME (2013) Schema therapy. A practitioner’s guide. Guilford,
   New York
Preserving Access to Previous System States
in the Lively Kernel
1 Introduction
inspect and change the state and behavior of objects at runtime. Programmers create
actual objects, not source code that only abstractly describes potential objects.
   The Lively Kernel was designed to support this kind of development (Lincke
and Hirschfeld 2013). It provides tools to directly manipulate the style, composi-
tion, and scripts of graphical objects. For example, programmers can change the
positions and composition of objects directly using the mouse. They can use
temporary workspaces to manipulate objects programmatically. They can edit and
try methods directly in the context of graphical objects.
   For example, to add new functionality to a graphical application, a Lively Kernel
user might copy an existing button object and then modify the new button object.
This could entail moving the new button to a sensible position, resizing it, setting a
new label, and adding a script to be executed on mouse clicks. The user makes all
changes directly to one button object. How this button fits into the application’s
interface is visible at all times. Clicking the button allows its functionality to be
tested directly. This way, the Lively Kernel makes fast feedback possible, espe-
cially during the development of graphical applications.
   Programmers’ changes to objects can turn out to be inappropriate. Programmers
can, for example, accidentally change positions or connect the wrong objects when
manipulating applications with mouse interactions. They might try a couple of
different alternatives such as different colors and layouts, only to realize that an
earlier state was more appealing. Similarly, programmers might learn in hindsight
that making a change to an object’s scripts introduced an error or impacts the
application’s performance. They might make a mistake in a code snippet, which
then results in the manipulation of many objects. At the same time, it should be
noted that problematic changes can also be introduced when code is evaluated for
the purpose of testing behavior. In this case the intent is not to make permanent
changes to the (current) state (of the program).
   However, when changes turn out to be problematic, programmers often need to
undo them manually. The Lively Kernel does not provide an undo for changes to
objects. This is especially at odds with the Lively Kernel’s support for trying ideas
right away. In other words, developers are able to make changes directly and
receive immediate feedback, but do not get support when such changes turn out
to be inappropriate. Thus, to recover a previous development state, programmers
often need to manually reset the state to how it previously was—probably using the
same tools the changes were initially made with. Furthermore, this potentially
involves multiple properties of multiple objects changed by multiple developer
actions.
   The Lively Kernel provides tools to commit and load versions of objects. In case
such commits exist, programmers can load earlier versions of objects to re-establish
previous states. Nevertheless, depending on how far the latest version is from the
actually desired state, manual changes might still be necessary. To keep the effort to
reestablish any previous state low, programmers would need to commit many
versions. However, this contradicts the goal because committing many versions is
also a significant effort. Some commits would be made only to protect intermediate
states and not to share and document results. Especially when the preserved
Preserving Access to Previous System States in the Lively Kernel                  237
conventional references point to the proxies and the proxies delegate all object
interactions transparently to particular versions of objects. Source transformations
introduce proxies consistently for all objects. Therefore, programmers do not need
to adapt their programs manually.
   This approach supports fine-grained histories of development states. Not every
state can be reestablished—only versions that have been preserved. In this way the
presented solution is a basis for recovery support that continuously preserves
versions.
2 Background
   manipulate properties, like the size and position, or to combine multiple ele-
   ments. This way, programmers always see intermediate states instead of only
   receiving feedback on explicit test runs in between edit-compile-load cycles.
Editing Graphical Objects at Runtime Many prototype-based programming
systems, including the examples given in this section, allow manipulating objects
at runtime. Scratch, Etoys, Fabrik, the Lively Kernel, and Self all provide tools
dedicated to manipulating graphical objects directly. Such graphical objects range
from basic objects like primitive shapes to complete applications like presentation
software or programming tools. Prototype-based programming, programming at
runtime, and direct manipulation of graphical objects seem to be properties that suit
each other.
Similar Objects Without Classes Different prototype-based programming sys-
tems provide different approaches for creating similar objects. Self and JavaScript
incorporate delegation to allow for prototypical inheritance. Objects can inherit
state and behavior directly from other objects: each object has a prototype to which
it delegates whenever looking up a property if the object itself yields no results. In
Self the prototype of an object is set when objects are cloned: The clone’s prototype
is the object it was cloned from.
    In JavaScript objects are created from constructor functions. The constructor
function’s prototype becomes the prototype of created objects. Kevo, in contrast,
does not incorporate this notion of prototypical inheritance. It provides concatena-
tion for incremental modification of objects (Taivalsaari 1995). Objects are copied
to create objects with the same state and behavior as existing objects. These objects
are self-contained. This means, any changes made to an object only affect this
object alone. A particular object can only be changed directly and this cannot be
carried out by changing any other object. To adapt many objects at once, pro-
grammers can use so-called module operations in Kevo. Module operations are
evaluated for groups of objects.
The Lively Kernel is a programming system in the tradition of Smalltalk and Self.
Development happens at runtime. It incorporates tools and techniques that allow it
to be completely self-sufficient. Thus, programmers can create versions of the
Lively Kernel with the Lively Kernel. The Lively Kernel is a browser-based
system. It is implemented in JavaScript and renders to Hyper Text Markup Lan-
guage (HTML).
240                                                                   L. Thamsen et al.
As the Lively Kernel is based in JavaScript, the system and applications are
expressed in a prototype-based, object-oriented language that provides prototypical
inheritance. At the same time, the Lively Kernel also provides a class system and
considerable parts of the system are also expressed using classes.
   The Lively Kernel implements Morphic (Maloney and Smith 1995), a frame-
work for developing graphical applications. The graphical objects of this frame-
work are called mMorphs. Each morph has a class but can also have object-specific
behavior. They can be created by instantiating a class or by copying an existing
morph. Morphs are often edited directly and not through adapting existing classes
or creating new ones. This way, the Lively Kernel mixes the class-based style with
the prototype-based style of object-oriented programming.
   The Lively Kernel’s copy operation does not establish a prototypical inheritance
relationship between the copy and the original. Instead, it creates a full copy of the
original morph’s properties, including its class. Therefore, even though JavaScript
incorporates prototypical inheritance, the Lively Kernel encourages programmers
to use classes to share behavior among objects.
Fig. 2 Three tools of Lively Kernel to manipulate morphs: the inspector, the style editor, and the
object editor
A related tool is the Lively Kernel’s Parts Bin (Lincke et al. 2012), an object repository
to commit and load specific versions of morphs. Morphs saved to the Parts Bin are
called parts to emphasize the ability to reuse any of the morphs in the Parts Bin for other
morphic applications. Figure 3 shows the Parts Bin, opened on the Tools category,
which includes both the Style Editor and the Object Editor. Both these tools are
examples of graphical applications developed from available parts. Their functionality
is expressed in scripts and they are available to users through the Parts Bin.
    The root of the scene graph of visible morphs is called a world. Worlds are not
shared via the Parts Bin, but can be saved as Web pages. A world stores the state of
all visible morphs when saved and that state can be reloaded with the world.
2.3 CoExist
CoExist preserves fast and easy access to previous development states (Krahn
et al. 2009). It is based on the insight that the risk for tedious recovery is caused
by the loss of immediate access to previous development states. With every change,
242                                                                 L. Thamsen et al.
Fig. 3 The Lively Kernel’s parts bin opened on the Tools category
the previous version is lost, unless it has been saved explicitly. This version,
however, can be of value in future development states, when, for example, an
idea turns out inappropriate. For that reason, CoExist creates a new version for
every change to the code base. Users can rapidly switch versions or can access
multiple versions next to each other. CoExist thus gives users the impression that
development versions co-exist. Figure 4 illustrates some of the main user interfaces
concepts of CoExist. It contributes the following concepts and tools:
• Continuous Versioning creates new versions in the background based on the
  structure of programs. It enables programmers to go back to a previous devel-
  opment state and to start over, which will implicitly create a new branch of
  versions.
• User Interface Concepts support browsing and exploring version information as
  well as quickly identifying a version of interest. Two different tools are pro-
  vided. First, the version bar highlights version items that match the currently
  selected source code element. Hovering over items will display additional
  information such as the kind of modification, the affected elements, or the actual
  change performed. Second, the version browser allows for exploring multiple
Preserving Access to Previous System States in the Lively Kernel                            243
Fig. 4 Conceptual figure of CoExist featuring continuous versioning, running tests and recording
the results in the background, and side by side exploring and editing of multiple versions
   With CoExist, programmers can change source code without worrying about the
possibility of making an error. They can rely on tools that will help with whatever
their explorations will turn up. They no longer have to follow certain best practices
in order to keep recovery costs low.
3 Motivation
To exemplify how developers work directly on objects in the Lively Kernel, we will
outline how a Lively Kernel user adds a new feature to the Object Editor.
   The editor has been developed by composing and editing graphical objects.
Thus, the user does not adapt any source code modules to change the editor, but
rather manipulates objects directly.
   In this example, the user adds a magnifier tool to the Object Editor. The
magnifier tool helps find the editor’s target, which is the object the editor currently
presents scripts for. Implementing the new feature requires creating a new button
morph and adding it to the editor, as shown in Fig. 5. The magnifier button has two
features:
1. When a programmer hovers over the button, the Object Editor’s current target is
   highlighted with a rectangular overlay.
2. When a programmer clicks the button, the current target selection is revoked and
   the programmer can select the new target of the editor.
   The following description covers the first of the two features, which is also
shown in Fig. 6 for an Object Editor currently targeting the character of a game.
Manipulating the Button Morph Before implementing the button’s behavior, the
user first creates the button and manipulates its visual appearance. Figure 7 shows
the steps in which the button is manipulated. A basic button, as visible in (1), can be
found in the Parts Bin repository. In (2), the user resizes the button and gives it a
square extent using the Resize halo button. Next, the user loads an image showing a
magnifier icon. Using drag and drop, the image is added to the button in (3).
Dropping one morph onto another connects the two morphs. Moving the button
around will then move the image accordingly. Finally, the user adds the result of
these manipulations, visible in (4), to the Object Editor.
Preserving Access to Previous System States in the Lively Kernel                   245
Fig. 5 The object editor’s magnifier button highlighted with a red outline
Fig. 6 The object editor’s magnifier button as it highlights the editor’s target
   All these changes to the state of the objects are made directly to the objects
themselves: the button morph, the magnifier image morph, and the editor morph.
When programmers edit parts in this way, they often see the effects of their actions
immediately. For example, when adding the new button to the Object Editor, the
button is visible at all times. Programmers do not need to run any code to see and
test the button.
246                                                                    L. Thamsen et al.
Scripting the Button Morph Now it is to the user to manipulate the button’s
behavior. The user adds scripts to the button that lay a translucent rectangle over the
current target. In particular, the button receives two scripts: onMouseMove and
onMouseOut. The implementation of the behavior includes the following:
• The button holds a semitransparent rectangle morph.
• When the mouse enters the button (onMouseMove), the button resizes and adds
  the rectangle to the Lively Kernel world at the position of the target.
• When the mouse leaves the button (onMouseOut), the button removes the
  rectangle from the world again.
   The Lively Kernel’s scripting tools allow evaluating code in the context of their
target objects. Hence, when programmers want to test a script or even just specific
lines of code, they can try the behavior directly for the actual target.
While manipulating objects directly, developers might make changes that they later
want to undo. For example:
• Accidental changes to state: The user could accidentally move a morph and,
  thereby, change a carefully arranged layout. Similarly, a meaningful state can be
  lost when a morph, for example the new button, is accidentally removed from the
  world.
• Inappropriate changes through direct manipulation: The user could make
  changes to the size, position, and colors of morphs to fine-tune the visual
  appearance of the editor’s interface, only to decide later that a particular inter-
  mediate version was more appealing.
• Accidental changes to scripts: The user could introduce a typographical error
  or accidentally remove a script. Moreover, editing a script could introduce a
  defect or a decrease in performance.
• Inappropriate changes through scripts: The user could make a mistake in a
  workspace snippet that is intended to manipulate morph properties programmat-
  ically. Such a snippet can change many properties of many objects.
Preserving Access to Previous System States in the Lively Kernel                   247
4 Object Versioning
An object could represent an address. The state of such an address object could be
as shown in Fig. 10.
   If values are assigned to the city and number fields of the address object, the
object’s state is changed. As the address object’s state is part of the system state,
changing the object’s state changes the system state as well. If we call the initial
state version v1 and the state after making changes to the object version v2, the state
of the address object is different in the two versions of the system, as shown in
Fig. 11.
   To be able to recover previous versions after making changes, the previous states
of objects need to be accessible. For this reason, versions of objects are preserved
and changes are made to new versions of the objects. A version of an object is, in
Preserving Access to Previous System States in the Lively Kernel                  249
v1 v2
              : Address                                         : Address
              street=Kantstr.                                   street=Kantstr.
              number=null                                       number=148
              city=null                                         city=Berlin
v1 v2
              : Address                                         : Address
              street=Kantstr.                                   street=Kantstr.
              number=null                                       number=null
              city=null                                         city=null
                                                                : Address
                                                                street=Kantstr.
                                                                number=148
                                                                city=Berlin
the simplest case, a copy of an object. When the address object is changed in
version v2 of the system, the system does not change the original address object but
rather the copy.
   As shown in Fig. 12, there are now two versions of the address objects in version
v2 of the system. One of the objects holds the original state, while the other holds
the state the object should have in version v2 of the system. The two objects contain
no information indicating to which version of the system they belong. They also do
not store any information showing that one object is a copy of the other.
   At the same time, references to objects remain unchanged. For example, there
could have been a person object referring to the address object. This reference
250                                                                        L. Thamsen et al.
would still be referring to the original address object, even in version v2 of the
system, as shown in Fig. 13. Even after adding values to the fields of the address
object, the following statement would still hold true when aPerson refers to the
person object:
Fig. 14 A version-aware reference relates a person object to two versions of its address property
aCompany.CEO.address.number
   Evaluating the statement in version v1 would return the value null, in version v2
the value 148, and in version v3 the value 112b.
   The information that one version is the predecessor of another version can be
used to resolve to an earlier object version when no current version is available.
This allows for creating new versions of objects only when necessary.
   The version identifier needs to be accessible to the version-aware references.
While it could be available globally, to have a single active version of the system, it
could also be scoped more locally such as thread-local or in the dynamic scope of a
code block. It should, however, not be changed while multiple version-aware
references of an object graph are resolved transitively. Consequently, the version-
aware references involved in evaluating the previous example statement should be
resolved together for the same version identifier.
   To be able to actually reestablish a particular version of the system with our
approach, two requirements need to be fulfilled. First, all mutable objects of the
programming runtime need to be accessed via version-aware references. Second,
the particular version of the system needs to be available. Our approach does not
allow reestablishing every state but specific states that have to be preserved. Pro-
grammers could preserve versions explicitly or the programming system could do
this implicitly. When the programming system automatically preserves versions,
each programmer action could implicitly yield a new version of the system. This
way, programmers could undo and redo the changes of their actions regardless of
whether or not they preserved a version in anticipation of recovery needs.
5 Discussion
   The statement does not include any version information. In particular, it does not
read a specific version from a table of available versions. Instead, the proxies
intercept all object interactions and forward to specific versions transparently.
They fulfill three responsibilities:
1. They know which versions are available for a particular object.
2. They choose a particular version among all available dynamically using context
   information.
3. They forward all interactions transparently to a chosen version.
Fig. 16 Using a proxy as version-aware reference to connect a person object to two versions of an
address object
254                                                                     L. Thamsen et al.
   The proxies in this design are virtual objects (Van Cutsem and Miller 2010).
They do not stand in for specific objects, but can forward intercepted interactions to
any object.
The proxies need to be used consistently for all mutable states. Ordinary references
that usually refer to an object need to refer to the proxy that stands in for the object.
   To use proxies consistently, we create and return proxies for all new objects. All
expressions that create new objects return proxies for those objects instead. This is
achieved by transforming code before it is executed. The source transformations
wrap object literals and constructor functions into proxies. The proxies also always
return proxies as return values. Thus, when proxied constructors are used, the
constructors return proxies for the new objects.
   The proxy encapsulates the reference to the initial version of an object. The
reference to the proxy is passed around instead. For this reason, all references that
would usually point to the same object point to the same proxy. This way, proxies
provide object identity. Checks that would usually compare one object to another
object now compare one proxy to another proxy.
   Only the proxies contain references to the versions of objects. This means that
when the proxies are no longer reachable the versions get garbage collected along
with the proxies. For example, in the code of Listing 4.1 there temporarily exists a
version-aware reference—a proxy—connecting the person object to an address
object. However, before a version of the system is preserved the reference gets
deleted. The address object is not required to reestablish either version 1 or version
2 of the system and nothing prevents the garbage collector from reclaiming the
proxy for the address object with the address object.
delete person.address;
Listing 4.1 A newly created object is not preserved with any version
Preserving Access to Previous System States in the Lively Kernel                     255
Proxies delegate to and create versions of an object using a version of the system. A
version of the system is an object that has a version identifier, a predecessor
version, and a successor version. Figure 17 shows three system versions. In the
example, version v2 is the current version of the system.
    The current version of the system is accessible to the proxies. Proxies use it to
decide to which version of an object they should currently forward. Figure 18 shows
a proxy, versions of an object, and versions of the system. In this example, there are
two object versions that correspond to the two system versions. The current version
of the system is v2 and, therefore, version v2 of the object is the target that the proxy
currently forwards to.
    As long as the system version stays the same, the proxies forward to the same
version of the object. Therefore, an object version is changed only as long as it
matches the current system version. To reestablish the previous version, the system
Fig. 18 A proxy with two object versions in context of the system versions
256                                                                             L. Thamsen et al.
Fig. 19 A new version of an object is created for a new version of the system
version has to be set to its predecessor. In that case, proxies forward interactions to
previous versions of the objects. To preserve the current version, the system version
has to be set to a different version. The proxies forward interactions to other object
versions or, when no such version of the object exists, they create new versions.
   A situation in which a new version of an object is created is shown in Fig. 19. In
a new version v3 of the system, the proxy intercepts a manipulation but has no
object version it can forward to. It therefore copies the most recent version of the
object and forwards manipulation request to this new copy.
   New versions are only necessary when a proxy is about to delegate manipula-
tions. As long as the state of an object is only read, the proxy reports values from a
previous version. The old version of the object still reflects the current state. To
create a new version, a proxy copies the most recent previous version of the object.
5.1.4 Limitations
The current design allows for to preserving and reestablishing versions of the
system. However, without further components these versions only exist in memory
and are not stored to disk. Our current design does not support multiple predeces-
sors or successors. Another limitation of the current design is that the state of
previous versions can be changed. New versions of objects are not affected by
changes to previous versions, but changes to object versions that have not been
copied are present in subsequent versions of the system.
Preserving Access to Previous System States in the Lively Kernel                  257
   In the future, the versioning might allow for branches and merging. This means
that changes to previous states could then be handled in branches that programmers
may or may not merge into future versions.
6 Related Work
6.1.1 CoExist
The approaches presented in this section allow combining changes into first-class
objects and running code with particular sets of changes.
6.2.1 Worlds
Worlds provide a language construct for controlling the scope of side effects:
changes to the state of objects are by default only effective in the world in which
the changes occurred. Worlds are first-class values and can be used to execute
statements with particular side effects being active. A new world can be spawned
from an existing world, which establishes a child-parent relationship between the
two worlds. Developers can commit changes from a child world to its parent world,
thereby extending the scope of the captured side effects. The world approach
includes conditions that prevent commits that would potentially introduce
inconsistencies.
   In comparison, Worlds provides a language construct for experimenting with
different states of the system, while object versioning allows for preserving ver-
sions of the system to recover previous states. Our approach does not include
extensions to the host programming languages and no conditions for combining
versions with their predecessor versions, but instead provides a basis for CoExist-
like continuous versioning and recovery tools.
   Other differences between Worlds and our approach regard implementations.
Our implementation in JavaScript does not prevent garbage collection as Worlds
does. Further, both use different libraries for source transformations. Our source
transformations are faster and do not use JavaScript exceptions.
6.2.4 ChangeBoxes
7 Summary
References
Ingalls D, Kaehler T, Maloney J, Wallace S, Kay A (1997) Back to the future: the story of squeak,
   a practical smalltalk written in itself. In: Proceedings of the 12th ACM SIGPLAN conference
   on object-oriented programming systems, languages and applications. OOPSLA’97. ACM,
   October 1997, pp 318–326
Ingalls D, Palacz K, Uhler S, Taivalsaari A, Mikkonen T (2008) The Lively Kernel—a self-
   supporting system on a web page. In: Self-sustaining systems. S3. Springer, May 2008, pp
   31–50
Kay A (2005) Squeak Etoys authoring and media. Technical report. Published February 2005.
   http://www.vpri.org/pdf/rn2005002_authoring.pdf. Accessed 7 Mar 2014
Krahn R, Ingalls D, Hirschfeld R, Lincke J, Palacz K (2009) Lively wiki: a development
   environment for creating and sharing active web content. In: Proceedings of the 5th Interna-
   tional symposium on wikis and open collaboration. WikiSym’09. ACM, October 2009,
   9:1–9:10
Lewis B (2003) Debugging backwards in time. In: Proceedings of the fifth international workshop
   on automated debugging. AADEBUG’03. Springer, September 2003, pp 225–235
Lieberman H (1986) Using prototypical objects to implement shared behavior in object-oriented
   systems. In: Conference proceedings on object-oriented programming systems, languages and
   applications. OOPLSA’86. ACM, June 1986, pp 214–223
Lienhard A (2008) Dynamic object flow analysis. Ph.D. thesis. University of Bern, December
   2008
Lienhard A, Gı̂rba T, Nierstrasz O (2008) Practical object-oriented back-in-time debugging. In:
   Proceedings of the 22nd European conference on object-oriented programming. ECOOP’08.
   Springer, July 2008, pp 592–615
Lincke J, Hirschfeld R (2012) Scoping changes in self-supporting development environments
   using context-oriented programming. In: Proceedings of the international workshop on
   context-oriented programming. COP’12. ACM, June 2012, 2:1–2:6
Lincke J, Hirschfeld R (2013) User-evolvable tools in the web. In: Proceedings of the 9th
   international symposium on open collaboration. WikiSym’13. ACM, August 2013, 19:1–19:8
Lincke J, Appeltauer M, Steinert B, Hirschfeld R (2011) An open implementation for context-
   oriented layer composition in ContextJS. Sci Comput Program 76(12):1194–1209
Lincke J, Krahn R, Ingalls D, R€ oder M, Hirschfeld R (2012) The lively partsbin—a cloud-based
   repository for collaborative development of active web content. In: Proceedings of the 2012
   45th Hawaii international conference on system sciences. HICSS’12. IEEE, January 2012, pp
   693–701
Maloney JH, Smith RB (1995) Directness and liveness in the morphic user interface construction
   environment. In: Proceedings of the 8th annual ACM symposium on user interface and
   software technology. UIST’95. ACM, December 1995, pp 21–28
Maloney J, Resnick M, Rusk N, Silverman B, Eastmond E (2010) The scratch programming
   language and environment. Trans Comput Educ 10(4):16:1–16:15
Pluquet F, Langerman S, Wuyts R (2009) Executing code in the past: efficient in-memory object
   graph versioning. In: Proceedings of the 24th ACM SIGPLAN conference on object oriented
   programming systems languages and applications. OOPSLA’09. ACM, October 2009, pp
   391–408
Shavit N, Touitou D (1995) Software transactional memory. In: Proceedings of the fourteenth
   annual ACM symposium on principles of distributed computing. PODC’95. ACM, June 1995,
   pp 204–213
Steinert B, Hirschfeld R (2014) How to compare performance in program design activities:
   towards an empirical evaluation of CoExist. In: Design thinking research. Understanding
   innovation. Springer, January 2014, pp 219–238
Steinert B, Cassou D, Hirschfeld R. CoExist: overcoming aversion to change. In: Proceedings of
   the 8th symposium on dynamic languages. DLS’12. ACM, January 2012, pp 107–118
Taivalsaari A (1992) Kevo, a prototype-based object-oriented language based on concatenation
   and modules operations. Technical report LACIR 92-02. University of Victoria
264                                                                            L. Thamsen et al.
Taivalsaari A (1995) Delegation versus concatenation or cloning is inheritance too. In: SIGPLAN
   OOPS messenger 6.3, July 1995, pp 20–49. Issn: 1055-6400
Taivalsaari A (1996) Classes vs. prototypes-some philosophical and historical observations. J
   Object Orient Prog 10(7):44–50
Ungar D, Smith RB (1987) Self: the power of simplicity. In: Conference proceedings on object-
   oriented programming systems, languages and applications. OOPSLA’87. ACM, December
   1987, pp 227–242
Ungar D, Smith RB (2007) Self. In: Proceedings of the third ACM SIGPLAN conference on
   history of programming languages. HOPL III. ACM, San Diego, CA, June 2007, 9:1–9:50
Van Cutsem T, Miller MS (2010) Proxies: design principles for robust object-oriented intercession
   APIs. SIGPLAN Not 45(12):59–72
Warth A (2009) Experimenting with programming language. Ph.D. thesis. University of Califor-
   nia, Los Angeles, December 2009
Warth A, Ohshima Y, Kaehler T, Kay A (2011) Worlds: controlling the scope of side effects. In:
   Proceedings of the 25th European conference on object-oriented programming. ECOOP’11.
   Springer, July 2011, pp 179–203
Connecting Designing and Engineering
Activities III
1 Introduction
  a)                                                              b)
                                                      Potential
                                                      Products
                            Viability                                                      Viability
a a
                                                  b                                                               b
                                    too big                                                         too big
                                to be desirable                                                 to be desirable
 Design
Prototype                                                                                  Feasibility
                            Feasibility
               Product
             Alternatives
Fig. 1 Three dimensions of Design Thinking, cf. Beyhl et al. (2013c), Brown (2009)
artifacts are seldom transcribed for further processing, although these artifacts
embody important design rationales, end-user feedback, and ideas that need to be
retrieved later on, when the innovative idea is made ready for the market. In Design
Thinking, these artifacts are signed entities that need to be associated with artificial
signs to be traceable, because they do not consist of natural signs. Design Thinkers
move these artifacts between different contexts and working states, which are often
captured in terms of photographs as well. These movements constitute a track that
represents the Design Thinker’s journey. Therefore, Design Thinkers are consid-
ered as sign and track makers. Engineers primarily use the created signs and tracks.
Therefore, engineers are considered as primary trace users. They primarily trace in
backward direction from the final documentation to earlier milestones to look up
design rationales and design alternatives. Design Thinkers are secondary trace
users, who use the traces for synthesis and reflection or when a follow-up Design
Thinking project should be kicked off.
    However, as in other disciplines (Arkley and Riddle 2005) traceability is often
considered as beneficial by trace users only and as obstructive by sign and track
makers. We observed that asking Design Thinkers to explicitly leave signs to create
a track for the sole purpose of design documentation (Menning et al. 2014) and later
retrieval does not work well. Thus, a manual retrieval of relevant design artifacts is
quite cumbersome and requires high effort, if possible at all. To ease the retrieval,
the design path including design rationales and design alternatives needs to be
recovered from daily working documents of Design Thinkers without explicitly
asking Design Thinkers to document their progress using a certain documentation
scheme.
    In this chapter, we present a recovery approach (Sect. 2) that enables the
reconstruction of the design path by employing recovery modules (Sect. 3) that
implement arbitrary knowledge extraction procedures and recovery algorithms to
turn implicit knowledge into explicit knowledge. In contrast to this implicit knowl-
edge, engineers and Design Thinkers can query this explicit knowledge. We
evaluate the recovered design path using inventory documentation collected in
educational Design Thinking settings (Sect. 4) and delimit our approach from
related work (Sect. 5) before concluding our approach (Sect. 6).
2 Recovery Approach
Figure 2 gives an overview about our recovery approach. The main idea of our
recovery approach is an active repository (Beyhl et al. 2013a; Beyhl and Giese
2015). An active repository is a repository that is steadily analyzed by recovery
modules. In doing so, the degree of retrievable information stored within the
repository is increased and information that would not be included explicitly in
the repository without analysis can be retrieved easily.
   In Design Thinking, the repository consists of Design Thinking project docu-
mentation, e.g. photographs of whiteboards and text documents that, e.g., describe
268                                                                                                                       T. Beyhl and H. Giese
                                                                                       <<describe>>         <<compose>>
                                                                                                                                           D
                                                                                                 Knowlegde
                                                                                                  Manager
                                                                                                                          D        E   C
                                                                       T1                 B                               Know-             Recovery
                                       DT Ontology                                                                  C     ledge              Model
               Engineer                                                T4                       <<instantiate>>            Flow
  <<answer>>
                                                         T2                       T3
                    <<query>>
                                                                                                                                            <<implement>>
                                                                                                                    A         B        F
                                                                                                 <<instantiate>>
                                      <<query>>                                                                      Recovery Module
                                                                Type
                                                                                          F                                       <<execute>>
                                                                       :T4                    <<generate>>
                                 G    Enriched DT     Type                             Type                                            E        Knowledge
  Query Engine                       Documentation                                                                  Inference Engine             Manager
                                     (Search Index)                  Inferred
                                                                                                <<reuse>>
                                                         :T2                     :T3
                    <<answer>>
                                                                    Knowledge
   <<query>>
                                      <<result>>
                                                                                          A     <<read>>
                                                             Role               Role
                                       DT Project
                                     Documentation
                                                                Models and
                                                               Model Elements
               Design
               Thinker
ontology does not describe how to recover the required knowledge. Therefore,
recovery modules are employed that need to be specified separately to recover the
knowledge. In other words, the recovery modules instantiate the Design Thinking
ontology and, therefore, enable the retrieval of Design Thinking related knowledge.
In this section, we present our Design Thinking ontology and introduce recovery
modules that instantiate the Design Thinking ontology. In the following sections,
we distinguish between different categories of knowledge: meta-data, stakeholder
related knowledge, methodology (process step, activity, technique) related knowl-
edge, and design rationale related knowledge.
more important when recovering the desired knowledge. Figure 3 depicts relevant
meta-data in Design Thinking project documentation that need to be extracted for
further processing. As depicted, each artifact can be labeled with meta-data such as
creation dates, keywords, and versions of an artifact. For example, keywords can be
entities such as stakeholders or process activities. QR codes are digital markers
within photographs that tag them explicitly with pre-defined keywords. Further-
more, extracted creation dates can be used to recover the chronological order of
artifacts.
   The extracted meta-data can be used to recover information about employed
techniques, activities, and process steps. Figure 4 depicts that each artifact in a
Design Thinking project documentation is the outcome of a certain phase. In
Design Thinking, a methodology phase can be a process step, activity, or technique
(design phase for short). A technique is the implementation of an activity. A process
step can consist of multiple activities that are considered as implementation of the
process step. Furthermore, a design phase can be the follow-up design phase of
another design phase. A follow-up phase is different from the design phase before,
e.g. a test phase is the follow-up phase of a prototype phase. Further, we distinguish
follow-up phases into continuations and iterations. In contrast to a follow-up design
phase, a continuation describes the proceeding of a design phase, e.g. the contin-
uation of a test phase. An iteration describes the repetition of a certain design phase,
e.g. 2nd prototype iteration.
   Figure 5 depicts the ontology of design artifacts. These design artifacts are of
special interest, because it is very likely that these artifacts embody the design
rationales that justify next design activities. We distinguish between process arti-
facts, inspirational artifacts, and milestone artifacts. Process artifacts are artifacts
that steer the overall process such as research questions, personas, synthesis doc-
uments, how-might-we questions, selected ideas, and prototypes. Transition
Connecting Designing and Engineering Activities III                                  271
artifacts are special process artifacts that mark the transition from one design phase
to another design phase. Transition artifacts are of special interest, because it is very
likely that they embody design rationales that justify the follow-up design phase by
summarizing the working result of the previous design phase. Milestone artifacts
are artifacts that mark milestones in the overall process such as project reports,
intermediate documentation (e.g. LogCal, Menning et al. 2014), and final docu-
mentation (e.g. LogBook). Milestone artifacts summarize the project progress and
are often the artifacts that are often created at the very end of a certain design path.
Inspirational artifacts are artifacts that are related to the design challenge and
inspire the design team such as souvenirs.
   Figure 6 depicts involved stakeholders. Stakeholders are the design team itself,
teachers in educational settings, project partners, and prospective end users. End
users can be extreme users. The knowledge about stakeholders is important to
differentiate findings and feedback, since it makes a difference whether the feed-
back was given by a prospective end user or a project partner.
272                                                                T. Beyhl and H. Giese
Figure 7 depicts our Design Thinking design path recovery model, which consists
of multiple related recovery modules. In this section, we introduce some of our
Design Thinking recovery modules and outline their purposes. Furthermore, we
describe the interplay of the recovery modules.
    We employ a naming scheme for recovery modules, e.g. R01a. Recovery
modules that recover the same kind of knowledge consist of the same number
(e.g. “01”). Recovery modules that recover the same kind of knowledge by
employing different procedures consist of the same number, but different suffix
letters (e.g. “a”) to be differentiable.
    The recovery modules without ingoing edges are considered as low-level recov-
ery modules. The purpose of low-level recovery modules is to extract relevant
meta-data from artifacts and provide the extracted meta-data to dependent recovery
modules. For example, file names and file headers are used to extract such meta-
data as summarized in Table 1.
    High-level recovery modules use the extracted meta-data. For example, recovery
modules that reconstruct the employed methodology analyze the extracted key-
words for each artifact to infer in which design phase the artifact was created.
Further, recovered knowledge about low-level methodology knowledge such as
employed techniques can be used to recover high-level methodology knowledge
such as employed activities and process steps. Table 2 gives an overview about the
recovery modules for the reconstruction of the employed methodology.
    Further, the recovered knowledge about the employed methodology can be used
to identify process artifacts and milestone artifacts. For example, the artifact created
at the very end of a certain design phase (e.g. point of view) is often a milestone
artifact (e.g. persona), because it summarizes the working state. Table 3 summa-
rizes the recovery modules for the detection of design artifacts.
Connecting Designing and Engineering Activities III                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             273
phases : Phase
folders : Folder
                                                                                                                                                                                                                                                                                                                                                                                                                                   keywords : Keyword
                                                                                                                                                                                                                                                                                                                 processSteps : ProcessStep
                                                                                       activities : Activityorders : CreationOrder       orders : CreationOrder                                processsteps : ProcessStep
                                                                                                                                                   processsteps : ProcessStep
                                                                                                                                                                                                                                              transitions : ActivityTransition
                                         R12ArtifactIteration
                                                                                                            R16aActivityFollowUp                                                                                                                                                                                                                                                                                                                R16bProcessStepFollowUp
                                                                                                                                                               activities : Activity
                                                                                                                                                                                                                                                            activities : Activity
                                                                                                                                                                    R06bRecoverActivityFromTechniques
                                                     versions : ChronologicalVersion                                                                                                                                                                                                                                                                                                                                                                     orders : CreationOrder
R03aChronologicalVersion R01eOrderFoldersByTime
                                                                                                                                                                                                                     R15aRecoverStakeholderFromKeywords
                                                                                                                                                                                                                                                                                        activities : Activity                                                                                                                                                                          creationDates : CreationDateFromName
                                                                                                                                                                                                                                                                                                                       R14aRecoverProcessArtifactFromKeywords                                                                                                                                          folders : Folder
                                                                                                                                                         techniques : Technique
                                                                                                         orders : CreationOrder
                                                                                                                                                                                R05RecoverTechnique                                                                                 R06aRecoverActivity                                                                      R08aRecoverProcessStep                                                                                                                                    versions : FormatVersion
                                                                                                                                                                                                                                                                                                                                                                                              processsteps : ProcessStep     creationDates : CreationDateFromName
                                                                                  R01dOrderFilesByTime
                                                                                                                                                                                                                                               keywords : Keyword                                                                                                                                                                                                                                                       R03bFormatVersion
                                                                                                                                                                                                                                                                                                                  keywords : Keyword                                                                                                                           R01aCreationDateFromName
   Milestone and process artifact knowledge can be used to identify which artifact
embodies knowledge about stakeholders, because for certain kinds of artifacts it is
rather likely that they embody knowledge about stakeholders than for other arti-
facts. For example, a persona embodies knowledge about a fictional stakeholder
that describes the needs of the prospective end-user of an innovation. Table 4
summarizes the recovery modules for recovering stakeholder knowledge.
274                                                                        T. Beyhl and H. Giese
Table 3 Recovery modules for the detection of milestone and process artifact
Recovery
module
no.          Purpose                        Exploited data (depends on)
R10          Recover transition artifact    Recovered activities (cf. R06a and R06b)
             between activities
R11          Recover transition artifact    Recovered process steps (cf. R08a and R08b)
             between process steps
R13     a    Identify milestone artifacts   Extracted keywords (cf. R00) and recovered design
                                            phase knowledge (cf. R06 and R08)
        b                                   Recovered creation order of artifacts (R01d) and
                                            recovered process steps (cf. R08)
R14     a    Identify process artifacts     Extracted keywords (cf. R00)
        b                                   Recovered milestone artifacts (cf. R13)
Table 4 Recovery modules for the identification of artifacts that embody stakeholder knowledge
Recovery
module
no.          Purpose                                          Exploited data (depends on)
R15    a     Identify artifacts that embody stakeholder       Extracted keywords (cf. R00)
       b     knowledge                                        Recovered milestone artifacts
                                                              (cf. R13)
Connecting Designing and Engineering Activities III                                     275
In this section, we describe how our Design Thinking ontology is instantiated with
the help of recovery modules by discussing some recovery module examples.
Recovery modules implement knowledge extraction and recovery algorithms to
reconstruct the knowledge as defined by the Design Thinking ontology. Note, that
different procedures can be employed to recover the same kind of knowledge.
Therefore, multiple recovery modules can exist that recover the same kind of
knowledge, but employ different procedures to do so. However, at least one recover
module per annotation type in the Design Thinking ontology should exist.
   In the following paragraphs, we describe the main ideas how the recovery
modules reconstruct certain kind of knowledge. According to our Design Thinking
ontology, we distinguish between different categories of recovery modules,
e.g. recovery modules that extract meta-data for further processing, recovery
modules that reconstruct the methodology at work such as knowledge about
employed process steps, activities, and techniques, and recovery modules that
provide an approximation which design artifacts contain design rationales and
design alternatives.
   We provide a default pattern language for implementing recovery modules as
depicted by Fig. 8. Solid rectangles denote Design Thinking artifacts of a certain
artifact type. Dashed rounded rectangles denote annotations of a certain annotation
type, i.e. recovered knowledge, that are created when the pattern and additional
constraints are fulfilled. Solid edges denote references between Design Thinking
artifacts. Dashed edges denote roles in which certain artifacts or annotations
participate in an annotation.
Fig. 8 Recovery modules R01b and R01d for the extraction of creation dates from file headers
and the reconstruction of the temporal order of design artifacts
276                                                                 T. Beyhl and H. Giese
employed activities. Figure 11 depicts the recovery module R06b that recovers
activities based on employed techniques. The recovery module takes knowledge
about employed techniques as defined by the input connector techniques and pro-
vides recovered knowledge about activities via the output connector activities. The
function activityFromTechnique() checks which activity is the most likely one for
the technique that was used to create the artifact. In case an activity could be found,
the activity name is assigned to the activity annotation as name attribute. Finally,
the recovered activities are provided via the output connector activities. Analo-
gously, knowledge about employed activities can be exploited to recover employed
process steps, because certain activities are characteristic for certain process steps.
   The recovered activities and process steps can be used to identify artifacts that
constitute the transition between two subsequent activities respectively process steps.
These artifacts are especially important, because it is very likely that they embody
important design rationales, which justify the next activity respectively process steps.
   Figure 12 depicts the recovery module R11 that identifies artifacts, which mark
the transition between two subsequent process steps. The recovery module requires
knowledge about process steps as defined by the input connector processSteps and
provides knowledge about transition artifacts via the output connector transitions.
The implementation of the recovery modules shows that an artifact is considered as
transition artifact when the artifact consists of two process step annotations that
describe different process steps (cf. value comparison of name attributes in terms of
an OCL expression on top of the recovery module). In case the condition is fulfilled,
a transition annotation is created that references both process step annotations
Connecting Designing and Engineering Activities III                                         279
Fig. 11 Recovery module R06b for the recovery of employed activities based on recovered
techniques
Fig. 12 Recovery module R11 for the recovery of artifacts that constitute the transition between
two subsequent process steps
280                                                                T. Beyhl and H. Giese
(cf. previous annotation and next annotation) and the recovered transition artifact.
Analogously, knowledge about artifacts that mark the transition between activities
can be recovered in the same manner.
   Besides artifacts that mark the transition from one design phase to another
design phase, so called milestone artifacts exist that summarize the working state
at several points in time during the design process. These milestone artifacts are
worth to be recovered, because it is very likely that they embody the justification for
taken design decisions and describe the outcome of a design phase. Several
approaches are imaginable how to identify such milestone artifacts within Design
Thinking project documentation. The straightforward approach is to lookup mile-
stone artifacts with the help of common keywords used for naming milestone
artifacts. For example, in educational Design Thinking settings special intermediate
reports and presentations are kindly requested by teachers and project partners.
   For example, the LogCal (Menning et al. 2014) is used for daily documentation,
the LogBook is used for final documentation, and intermediate presentations are
given to project partners and other design teams to share ideas and receive feed-
back. A more sophisticated approach to identify milestone artifacts bases on the
assumption that artifacts created at the very end of a certain design phase are
milestone artifacts. Therefore, knowledge about the chronological order of artifacts
and design phases need to be combined to identify milestone artifacts in Design
Thinking project documentation.
   Figure 13 depicts the recovery module R13b that identifies milestone artifacts
within a certain process step. Since knowledge about process steps and creation
orders is required to recover milestone artifacts, the recovery module consists of
input connectors that consume knowledge about process steps and creation orders.
Further, folders are often used to organize files that belong to the same process step.
We consider the parent folder as container for artifacts that belong to the same
process step. Therefore, the recovery module R13b also consumes knowledge
about folders via the artifact connector folders. As depicted by Fig. 13, the last
created artifact within a process step is the artifact that does not act in a role of a
previous artifact in creation order annotations. Figure 13 depicts this negative
application condition in terms of a crossed out creation order annotation. Finally,
recovered milestone artifacts are provided via the milestones output connector to
dependent recovery modules.
   Process artifacts are artifacts that steer the overall design process. For example, a
persona summarizes the needs of a fictional prospective end-user, while how-to-
questions serve as basis for ideation. Besides exploiting extracted keywords to
identify such process artifacts, recovered milestone artifacts can be exploited to
identify process artifacts. For example, a persona is often the result and last created
artifact of the point of view process step and also an important process artifact,
because it serves as basis to create how-to questions for ideation.
   Figure 14 depicts the recovery module R14b that recovers process artifacts based
on knowledge about milestone artifacts. The implementation of the recovery
module R14b shows that milestone artifacts and their attached phase annotation
are used to map milestone artifacts to process artifacts (cf. OCL expression on top
of the recovery module).
Connecting Designing and Engineering Activities III                    281
Fig. 15 Recovery module R08a and R08b that recover knowledge about employed process steps
based on knowledge about keywords respectively employed activities
4 Evaluation
In this section, we give an evaluation of our Design Thinking recovery model. For
that, we distinguish between a qualitative evaluation (Sect. 4.1) and a quantitative
evaluation (Sect. 4.2). In the qualitative evaluation, we provide recovery examples
and argue why the recovered knowledge must be correct. In the quantitative
Connecting Designing and Engineering Activities III                                    283
evaluation, a student assistant rates the recovered knowledge from his perspective
in a subjective manner. Finally, we discuss the validity of our evaluation (Sect. 4.3).
In this section, we show multiple recovery examples. For that, we took a well-
organized Design Thinking project documentation from an educational Design
Thinking project with a duration of 12 working days and applied our Design
Thinking recovery model.
   In the following figures, solid rectangles denote artifacts such as file and folders.
Solid lines between solid rectangles denote that an artifact contains another artifact.
The filled rhombus denotes the super ordinate artifact. Dashed rounded rectangles
denote annotations, i.e. recovered knowledge. Dashed lines denote roles of artifacts
respectively annotations.
   Figure 16 depicts an example how the storytelling folder was assigned to the
process step point of view. The recovery module R05 recovered that the storytelling
folder is the result of a storytelling activity, because the folder is named “storytell-
ing”. Further, the recovery module R06b recovered that the storytelling folder
belongs to an unpacking activity, because the storytelling technique is often used
to unpack observations and gained insights. Since the storytelling folder contains
photographs with images of clustered observations, it is very likely that the
unpacking activity was inferred correctly. Next, the recovery module R08b recov-
ered that the storytelling folder belongs to the process step point of view, because
the unpacking activity is a common activity in the point of view phase. Since the
Fig. 16 Recovery of process step knowledge based on knowledge about employed activities and
techniques
284                                                                    T. Beyhl and H. Giese
Fig. 17 Recovery of process step transitions and milestones based on process step knowledge
Fig. 18 Recovery of follow-up activities based on knowledge about creation orders of artifacts
and activities
last created artifact in the folder “Synthesis and POV” respectively point of view
process step, the “POV.jpg” artifact is identified as milestone artifact by recovery
module R13b. This piece of recovered knowledge is correct, because the “POV.
jpg” artifact embodies a photograph of a persona, which summarizes the needs of a
fictional person that is used for ideation later on.
    Figure 18 depicts that the unpacking activity is a follow-up activity of the
questioning activity. The recovery module R00a extracted the keywords for the
artifact “questions-kickoff.jpg” and “unpacking-kickoff.jpg”. Next, the recovery
module R06a assigned the artifacts to the questioning activity respectively
unpacking activity. Since the “questions-kickoff.jpg” artifact was created before
the “unpacking-kickoff.jpg” artifact as derived by recovery module R01d, the
questioning activity happened before the unpacking activity. Thus, the unpacking
activity is a follow-up activity of the questioning activity. This piece of knowledge
is correct, because the “unpacking-kickoff.jpg” artifact shows a mind-map that
answers some of the previously stated questions. Furthermore, the “questions-
kickoff.jpg” artifact was identified as process artifact that contains research ques-
tions by recovery module R14a. This piece of knowledge is correct, because the
“questions-kickoff.jpg” artifact embodies a photograph of post-its that state several
questions at the beginning of the project.
                                                               R05   R06a R06b R08a R08b   R10   R11   R13a R13b R14a R14b R15a R15b R16a R16b R17a R17b
                                                                                                  High-Level Recovery Modules
Fig. 19 Arithmetic mean of Likert values and confidence intervals per recovery modules
scale independently from the researchers, who implemented the recovery modules.
We have chosen to employ a one to seven Likert scale. The Likert value of one
denotes a strong disagreement, while the Likert value of seven denotes a strong
agreement.
   For our quantitative evaluation, we took a well-documented Design Thinking
project documentation in terms of a file system structure. The project documenta-
tion consists of overall 166 design artifacts that are stored in a single-level file
system structure. Note, that the student assistant only evaluated created annotations.
Therefore, we are only able to speak about true positives and false positives in the
following paragraphs.
   Figure 19 depicts a bar chart that shows the evaluation results for our high-level
recovery modules. We neglected low-level recovery modules, i.e. recovery mod-
ules that only extract meta-data, in our evaluation, because they just extract meta-
data and, therefore, always yield correct results. Each bar of the bar chart denotes
the arithmetic mean of all Likert values of all annotations created by a certain
recovery module. Furthermore, the confidence interval is depicted as thin line on
top of each bar. Note, that the confidence interval is computed over all created
annotations per module and defines the Likert value range in which 95 % of all
annotations are situated concerning their correctness. As shown by Fig. 18, the
recovery results are in general promising, because the arithmetic mean is in general
above the Likert value of 4.
   Especially, the recovery modules that recover the employed techniques (R05),
activities (R06), and process steps (R08) yield results that are agreed by the student
assistant with a Likert value between 5.03 and 6.11. Based on these results, the
recovery module that detects transition artifacts between activities (R10) yields
positive recovery results with a Likert value of 5.09. The recovery modules that
detect transition artifacts between process steps (R11) yield neutral recovery results
with a Likert value of 4.00. Furthermore, the knowledge about employed process
steps and activities is exploited by the recovery modules R16 and R17 to
Connecting Designing and Engineering Activities III                                287
   Note also, that high-level recovery modules depend on the correctness of the
recovery results of low-level recovery module. Thus, high-level recovery modules,
which depend on low-level recovery modules that perform poorly, also do not yield
good results. These dependencies have to be considered when assessing recovery
modules.
5 Related Work
In the past several attempts have been made to capture Design Thinking artifacts for
documentation purposes. For example, the Tele-Board project (Gericke et al. 2011;
Gumienny et al. 2012) enables global collaboration of design teams by providing
digital whiteboards with an interface similar to analog whiteboards. Further, dif-
ferent software tools have been developed that capture Design Thinking documen-
tation artifacts and support their organization. For example, ConnectingInfos
(Voget 2013) implements an algorithm that rates Design Thinking artifacts
concerning their importance based on ingoing and outgoing connections to other
artifacts. Taking the algorithm implemented within ConnectingInfos, Kroschk
extended this algorithm by distinguishing between importance of artifacts and
relevance of artifacts at certain points in time (Kroschk 2014). Kroschk integrated
his extended algorithm into ProjectZoom (Beyhl et al. 2013c; Beyhl and Giese
2015). ProjectZoom (Beyhl et al. 2013b) is a software tool that enables the
capturing of Design Thinking artifacts from different sources such as online file
storages. Furthermore, ProjectZoom enables the organization of captured Design
Thinking artifacts on a virtual whiteboard with the help of well-known techniques
such as clustering, relations between artifacts, and highlighting of important sec-
tions within artifacts (Koch 2014).
    Design Thinking is a modern form of requirements engineering. While Brown
defines the three dimensions of Design Thinking (Brown 2009), i.e. desirability,
viability, and feasibility, Pohl defines the three dimensions of requirements engi-
neering (Pohl 1994), i.e. specification, agreement, and representation. In software
engineering, traceability is employed to “follow the life of requirements in both a
forwards and backwards direction” (Gotel and Finkelstein 1997). Winkler and von
Pilgrim (2010) give an overview about traceability in requirements engineering.
    Additional research areas, which are related to our approach, are knowledge
discovery and information retrieval. Knowledge discovery deals with detecting new
unknown structures in data sets (Goebel and Gruenwald 1999). Information
retrieval deals with finding information according to a query stated by users
(Mitra and Chaudhuri 2000). By developing recovery modules that instantiate our
Design Thinking ontology, we had to identify common structures that indicate
properties as defined by our Design Thinking ontology and, therefore, we deal with
knowledge discovery. By instantiating the Design Thinking ontology, we enable
information retrieval based on queries stated by engineers and design thinkers. At
Connecting Designing and Engineering Activities III                                           289
the time of writing no recovery approach for Design Thinking exists that is similar
to our approach.
   But, in software engineering related approaches exist that recover software
engineering knowledge such as software design patterns. For example, Niere
et al. (2003) describe an algorithm to recover software design patterns as described
by the Gang of Four (Gamma et al. 1994). While the approach of Niere et al. is
limited to software design patterns recovery, our approach is generic due to its
concept of recovery modules, which enable the implementation of arbitrary and
recovery domain independent recovery algorithms.
Acknowledgement The authors thank Claudia Nicolai (D-School Potsdam) and Axel Menning
(D-School Potsdam) for their input. The authors also thank their student assistants Josephine
Harzmann, Manuel Hegner, Axel Kroschk, and Lukas Pirl for their support. We also give thanks to
our bachelor project team “Intelligent Organization of Design Thinking Artifacts” for their input.
290                                                                       T. Beyhl and H. Giese
References
Arkley P, Riddle S (2005) Overcoming the traceability benefit problem. In: CORD conference
   proceedings, April 2005, pp 385–389
Beyhl T, Giese H (2015) Connecting designing and engineering activities II. In: Plattner H,
   Meinel C, Leifer L (eds) Design thinking research—building innovators. Springer, Cham, pp
   211–239
Beyhl T, Berg G, Giese H (2013a) Connecting designing and engineering activities. In: Plattner H,
   Meinel C, Leifer L (eds) Design thinking research—building innovation eco-systems.
   Springer, Heidelberg
Beyhl T, Berg G, Giese H (2013b) Towards documentation support for educational design
   thinking projects. In: International conference on engineering and product design education,
   September 2013, pp 408–413
Beyhl T, Berg G, Giese H (2013c) Why innovation processes need to support traceability. In:
   Workshop on traceability in emerging forms of software engineering 2013, San Francisco,
   May 2013
Brown T (2009) Change by design. HarperCollins, New York, NY
Gamma E, Helm R, Johnson R, Vlissides J (1994) Design patterns—elements of reusable object-
   oriented software. Addison-Wesley, Reading, MA
Gericke L, Gumienny R, Meinel C (2011) Tele-Board: follow the traces of your design process
   history. In: Design thinking research. Springer, Berlin, July 2011, pp 15–29
Goebel M, Gruenwald L (1999) A survey of data mining and knowledge discovery software tools.
   ACM SIGKDD explorations newsletter
Gotel O, Finkelstein A (1997) Extended requirements traceability: results of an industrial case
   study. In: Proceedings of the third IEEE international symposium on requirements engineering
   1997, pp 169–178
Gotel OCZ, Morris SJ (2011) Out of the labyrinth: leveraging other disciplines for requirements
   traceability. In: 19th IEEE international requirements engineering conference (RE), pp 121–130
Gruber TR (1994) Toward principles for the design of ontologies used for knowledge sharing? Int
   J Hum Comput Stud 43(5):907–928
Gumienny R, Gericke L, Wenzel M, Meinel C (2012) Tele-Board in use: applying a digital
   whiteboard system in different situations and setups. In: Plattner H, Meinel C, Leifer L (eds)
   Design thinking research—measuring performance in context. Springer, Heidelberg, pp 109–125
Koch J (2014) Visualizing and searching of design thinking project documentation. Bachelor
   thesis, Potsdam
Kroschk A (2014) Infering new knowledge into design thinking documentations. Bachelor thesis,
   Potsdam
Menning A, Beyhl T, Giese H, Weinberg U, Nicolai C (2014) Introducing the LogCal: template-
   based documentation support for educational design thinking projects. In: International con-
   ference on engineering and product design education
Mitra M, Chaudhuri BB (2000) Information retrieval from documents: a survey. Inf Retr 2
   (2):141–163
Niere J, Wendehals L, Zündorf A (2003) An interactive and scalable approach to design pattern
   recovery. October 2003
Plattner H, Meinel C, Weinberg U (2009) Design Thinking—Innovation lernen—Ideenwelten
   €offnen. mi-Wirtschaftsbuch, Finanzbuch Verlag GmbH, March 2009
Pohl K (1994) The three dimensions of requirements engineering: a framework and its applica-
    tions. In: Selected papers from the fifth international conference on advanced information
    systems engineering. Pergamon Press, Elmsford, NY, pp 243–258
Voget L (2013) An association-based documentation tool to support design thinking groups in
    using their acquired knowledge. Master thesis, June 2013
Winkler S, von Pilgrim J (2010) A survey of traceability in requirements engineering and model-
    driven development. Softw Syst Model 9(4):529–565