Andrew Burks

Building systems that help people make sense of data.

Andrew Burks

Principal Research Scientist & Director of Decision Sciences at Epsilon. Ph.D. Candidate in Computer Science at the University of Illinois at Chicago.

Building DiME, a visual analytics platform that synthesizes proprietary identity data and client datasets to surface customer insights. Leading product architecture, UI/UX design, and Ada, the natural language interface that scaffolds analysis and answers questions directly.

milestone
Promoted to Principal Research Scientist | Director, Decision Sciences

Promoted to Principal Research Scientist and appointed Director of Decision Sciences Visual Analytics at Epsilon.

milestone
Promoted to Sr. Staff Research Scientist

Promoted to Senior Staff Research Scientist in the Decision Sciences Visual Analytics group at Epsilon.

milestone
Full-time at Epsilon as Sr. Visual Analytics Engineer

transitioned from intern to full-time Senior Visual Analytics Engineer at Epsilon.

publication

Securing Collaborative Work in Wide-band Display Environments

publication

Traces of Time through Space: Advantages of Creating Complex Canvases in Collaborative Meetings

project

Alveolus Analysis: A Web-based Tool to Analyze Lung Intravital Microscopy

Acute lung injury and the acute respiratory distress syndrome (ARDS) are characterized by pulmonary inflammation, reduced endothelial barrier integrity and filling of the alveolar space with protein rich edema fluid and infiltrating leukocytes. Animal models are critical to uncovering the pathologic mechanisms of this devastating syndrome. Intravital imaging of the intact lung via two photon intravital microscopy has proven a valuable method to investigate lung injury in small rodent models through characterization of inflammatory cells and vascular changes in real time. However, respiratory motion complicates the analysis of these time series images and requires selective data extraction to stabilize the image. Consequently, analysis of individual alveoli may not provide a complete picture of the integrated mechanical, vascular and inflammatory processes occurring simultaneously in the intact lung. To address these challenges, we developed a web-based visualization application named Alveolus Analysis to process, analyze and graphically display intravital lung microscopy data. RESULTS: The designed tool takes raw temporal image data as input, performs image preprocessing and feature extraction offline, and visualizes the extracted information in a web-based interface. The interface allows users to explore multiple experiments in three panels corresponding to different levels of detail: summary statistics of alveolar/neutrophil behavior, characterization of alveolar dynamics including lung edema and inflammatory cells at specific time points, and cross-experiment analysis. We performed a case study on the utility of the visualization with two domain experts and they found the tool useful because of its ability to preprocess data consistently and visualize information in a digestible and informative format. CONCLUSIONS: Application of our software tool, Alveolus Analysis, to intravital lung microscopy data has the potential to enhance the information gained from these experiments and provide new insights into the pathologic mechanisms of inflammatory lung injury.

essay

Andrew's React Basics

A mental model for React that scales to large applications like SAGE3. Covers foundational definitions of overloaded terms like 'state' and 'instance' as used in this context.

note

Slate Gun Deaths Visualization

project

VisSnippets: A Web-based System for Impromptu Collaborative Data Exploration on Large Displays

The VisSnippets system is designed to facilitate effective collaborative data exploration. VisSnippets leverages SAGE2 middleware that enables users to manage the display of digital media content on large displays, thereby providing collaborators with a high-resolution common workspace. Based in JavaScript, VisSnippets provides users with the flexibility to implement and/or select visualization packages and to quickly access data in the cloud. By simplifying the development process, VisSnippets removes the need to scaffold and integrate interactive visualization applications by hand. Users write reusable blocks of code called 'snippets' for data retrieval, transformation, and visualization. By composing dataflows from the group's collective snippet pool, users can quickly execute and explore complementary or contrasting analyses. By giving users the ability to explore alternative scenarios, VisSnippets facilitates parallel work for collaborative data exploration leveraging large-scale displays. We describe the system, its design and implementation, and showcase its flexibility through two example applications.

publication

Vissnippets: A Web-based System for Impromptu Collaborative Data Exploration on Large Displays

essay

Mixed-Platform Visual Data Exploration using VisSnippets

A Unity VR application that extends the VisSnippets collaborative visual data exploration system to immersive 3D environments. Demonstrates declarative data operations (VegaLite charts, external selections) that bridge a Node.js analysis backend to a Unity VR client via REST API.

note

CS528 Project 2 Proposal

essay

CS528 Project 1

A Unity VR application set in the UIC Grove that demonstrates two explicit interactions: controlling a grill and a sprinkler system. The environment features animated vehicles, day/night cycle with calculated sun positions, and spatialized audio throughout.

project

SAGE2 + JupyterLab Integration

To better integrate existing data science workflows into the SAGE2 collaborative experience, we provide a SAGE2 plugin for JupyterLab. The plugin supports sharing both static notebooks, as well as dynamically updating cell content to SAGE2. A user simply connects to one or more SAGE2 server by URL, and can start sharing their Jupyter Notebook content. Shared notebooks are rendered by Jupyter nbviewer as static web-pages on the SAGE2 wall. Shared notebook cell image output is sent as an image to SAGE2 and dynamically updates every time that a cell is run.

publication

Usage Patterns of Wideband Display Environments in E-science Research, Development and Training

milestone
Hired at Epsilon as Data Visualization Intern

Joined Epsilon as a data visualization intern.

project

Details-First, Show Context, Overview Last: Supporting Exploration of Viscous Fingers in Large-Scale Ensemble Simulations

Visualization research often seeks designs that first establish an overview of the data, in accordance to the information seeking mantra: “Overview first, zoom and filter, then details on demand”. However, in computational fluid dynamics (CFD), as well as in other domains, there are many situations where such a spatial overview is not relevant or practical for users, for example when the experts already have a good mental overview of the data, or when an analysis of a large overall structure may not be related to the specific, information-driven tasks of users. Using scientific workflow theory and, as a vehicle, the problem of viscous finger evolution, we advocate an alternative model that allows domain experts to explore features of interest first, then explore the context around those features, and finally move to a potentially unfamiliar summarization overview. In a model instantiation, we show how a computational back-end can identify and track over time low-level, small features, then be used to filter the context of those features while controlling the complexity of the visualization, and finally to summarize and compare simulations. We demonstrate the effectiveness of this approach with an online web-based exploration of a total volume of data approaching half a billion seven-dimensional data points, and report supportive feedback provided by domain experts with respect to both the instantiation and the theoretical model.

project

Precision Risk Analysis of Cancer Therapy with Interactive Nomograms and Survival Plots

We present the design and evaluation of an integrated problem solving environment for cancer therapy analysis. The environment intertwines a statistical martingale model and a K Nearest Neighbor approach with visual encodings, including novel interactive nomograms, in order to compute and explain a patient's probability of survival as a function of similar patient results. A coordinated views paradigm enables exploration of the multivariate, heterogeneous and few-valued data from a large head and neck cancer repository. A visual scaffolding approach further enables users to build from familiar representations to unfamiliar ones.

milestone
United Airlines Internship Complete

Finished data visualization internship at United Airlines.

project

SAGE2 Partitions

While SAGE2 allows users the complete freedom to organize their content, SAGE2 Partitions aim to provide optional structure to the organization. SAGE2 partitions can act as divisions of the screen space, deforming their neighbors when resized. The Partitions can be 'un-snapped' to allow for free movement, becoming a simple grouping of applications. Partitions can be 'tiled' organizing the content within into a grid. As content is added or removed from a Partiton, the groupings are dynamically updated and content dynamically rearranged. Click-and-drag interaction can be used both to create new Partitions in an empty space and essentially select apps to grab, or to 'cut' existing Partitions to divide the size and content.

milestone
Data Visualization Internship at United Airlines

Started data visualization internship at United Airlines.

milestone
Started PhD at EVL/UIC

Began PhD program at the Electronic Visualization Laboratory, University of Illinois at Chicago.

publication

Details-first, Show Context, Overview Last: Supporting Exploration of Viscous Fingers in Large-scale Ensemble Simulations

publication

Precision Risk Analysis of Cancer Therapy with Interactive Nomograms and Survival Plots

milestone
Graduated Bachelors of Science (BS) Computer Science from UIC

Earned BS in Computer Science from the University of Illinois at Chicago.

publication

MC2: Mining Factory Pollution Data through a Spatial-Nonspatial Flow Approach

publication

MC1: A Bespoke Analysis Tool for Spatio-temporal Park Traffic Data

publication

MC3: A Web-Based Interactive Image Explorer for Temporal Analysis of Satellite Images

project

Visualizing ensemble time-evolving probability landscapes of stochastic networks

The computational study of the dynamic and stochastic natures of gene regulatory networks is a challenging topic in systems biology. Visualizing ensemble time-evolving probability landscapes of stochastic gene networks can further biologists’ understanding of phenotypic behavior associated with specific genes. We present a web-based visual analysis tool for the exploration of peak distributions over state space and simulation time in such stochastic networks, and the comparison of peak distributions between multiple simulations. Our approach combines multiple linked views to capture ensemble time-evolving probability landscapes. A peak trajectory cube provides users an overview of peak spatiotemporal distributions between six simulations. A peak projection map shows the exact peak locations of multiple simulations at the user selected time. At a more detailed level, users can inspect a particular state in the peak projection map to view for each simulation both the probability values over time, and the local probability landscape shapes. This information is displayed in a small multiple using two glyphs: profile glyphs and arrow glyphs. The arrow glyph indicates that a state is a peak when all the glyph eight arrows point towards the glyph center. In the figure, a disagreement between the arrow glyphs and the peak projection map demonstrates that probability distributions over genes in this system are not independent of each other. Our visual analysis tool allows bioinformatics researchers to explore and compare the time evolving changes of probability landscapes from multiple simulations efficiently, without running many small scripts and computing all characteristics separately.

project

Context-Aware Visualization of Englewood Social Services

The Englewood Data Hub (EDH) is a project that brings together both crowdsourced and public data surrounding the Englewood area into a single place.<br>The EDH Resource Directory makes use of the crowdsourced service and school data to display an interactive map with their locations. Its user interface (UI) allows for an exploratory experience that offers filtering of services by names and type and displays detailed information about the services available in Englewood. The EDH Analytics Tool combines the crowdsourced service and school data with public data sets to display visualizations; specifically, the public datasets used are 2010 Census data along with Vacant Lot and Crime data from the Chicago Data Portal. The Englewood Data Hub has been developed by a group of researchers at the Electronic Visualization Laboratory, University of Illinois at Chicago under the direction of Prof. G. Elisabeta Marai. Team members include EVL research experience undergraduate students (REUs): Andrew Burks, Joshua Castor, and Isabel Lindmae. This project is funded by the The Joseph and Bessie Feinberg Foundation and the University of Illinois at Chicago.

project

Dynamic Influence Networks for Rule-Based Models

We introduce the Dynamic Influence Network (DIN), a novel visual analytics technique for representing and analyzing rule-based models of protein-protein interaction networks. Rule-based modeling has proved instrumental in developing biological models that are concise, comprehensible, easily extensible, and that mitigate the combinatorial complexity of multi-state and multi-component biological molecules. Our technique visualizes the dynamics of these rules as they evolve over time. Using the data produced by KaSim, an open source stochastic simulator of rule-based models written in the Kappa language, DINs provide a node-link diagram that represents the influence that each rule has on the other rules. That is, rather than representing individual biological components or types, we instead represent the rules about them (as nodes) and the current influence of these rules (as links). Using our interactive DIN-Viz software tool, researchers are able to query this dynamic network to find meaningful patterns about biological processes, and to identify salient aspects of complex rule-based models. To evaluate the effectiveness of our approach, we investigate a simulation of a circadian clock model that illustrates the oscillatory behavior of the KaiC protein phosphorylation cycle.

publication

Dynamic Influence Networks for Rule-based Models