Orange Lab: Lowering Barriers to Data Mining through Embedded Interactive Workflows

2026-06-08Machine Learning

Machine LearningHuman-Computer Interaction
AI summary

The authors created Orange Lab, a web-based tool that lets people build and explore machine learning workflows visually. This system makes it easy to share and embed parts of these workflows into websites, so users can interact with them without needing to know all the technical details. Their main idea, called component exposition, helps hide complexity while keeping everything connected and responsive. They tested this in teaching settings, where it helped students learn about machine learning in a hands-on way. Overall, the authors show that Orange Lab can make data science more accessible and interactive online.

visual programmingmachine learning workflowsinteractive web environmentscomponent expositiondata-driven storytellingvisual analyticsdata literacymodular componentsinteractive interfacescollaborative environment
Authors
Matej Bevec, Aleš Erjavec, Vesna Tanko, Lena Trnovec, Lan Žagar, Ana Farič, Janez Demšar, Blaž Zupan
Abstract
While visual programming of data analysis workflows has become an important vehicle for the democratization of data science, such systems remain largely confined to standalone applications and offer limited support for transitioning their visual analytics solutions into interactive web environments. As a result, data analysis pipelines are difficult to share, embed, and adapt into user-facing analytical tools. We present Orange Lab, a web-based collaborative environment for visual data analytics. At its core, Orange Lab enables users to visually construct machine learning workflows from modular components, where interactions in any component propagate seamlessly through the workflow, turning static pipelines into dynamic, reactive systems that support exploration and data-driven storytelling. Our key contribution is component exposition, a paradigm that allows authors to embed selected workflow components, or parts of their interfaces, into arbitrary web contexts, creating synchronized, interactive interfaces while hiding underlying workflow complexity. This enables the development of tailored analytical views and narrative-driven experiences that integrate data analysis directly into online materials. We demonstrate the approach through deployments in data literacy education, where embedded components guide students in hands-on exploration of machine learning concepts without requiring knowledge of the underlying system, showing that Orange Lab effectively lowers barriers to entry and supports the democratization of data science.