Data Visualization

Data Visualization

Intro

Learn the fundamentals and get hands-on experience with data visualization, including:

  • how to make useful visualizations and dashboards of data,
  • how to help yourself and others to understand and analyse that data,
  • how to use state-of-the-art tools and techniques
read more… less…

Download folder

Ter plaatse

Wanneer?

Van 22/09/2023 t.e.m. 10/11/2023

Waar?

KU Leuven - Brugge

Docenten

Aditya Bhattacharya, Jefrey Lijffijt, Quinten Rosseel, Katrien Verbert

read more… less…

Basisprijs

1800 euro

Open everything Close everything

Target audience

The intended audience consists of professionals, engineers, and researchers that want to learn the fundamentals and get hands-on experience with

  • how to make useful visualizations and dashboards of data,
  • how to help yourself and others to understand and analyse that data,
  • how to use state-of-the-art tools and techniques.

To follow the course, you will need to bring a laptop. The course requires familiarity with programming. In the course Python will be used. It is not necessary to have expertise in Python, but exercises will involve you writing Python code, so if you do not have expertise, you should be willing and able to familiarize yourself with Python syntax. Exercises will be hosted through Google Colab, for which you need a Google account. For Session 4 it is recommended to install Tableau Public on your laptop.

Program

The course consists of three modules of two sessions each. Namely,

  • Data preparation (sessions 1 & 2),
  • Data visualization & interactive graphics (sessions 3 & 4),
  • Dashboards & structured data (sessions 5 & 6).

 

Session 1 (22 September): Data preparation

The basics of data preparation with Python will be covered:

  • Common operations & libraries
    • Challenges in batch data preprocessing
  •  Missing or incomplete records
  •  Outliers or anomalies
  •  Improperly formatted / structured data
  •  Inconsistent values and non-standardized categorical variables
  •  Limited or sparse features / attributes
    •  Functions of Pandas (or Pyspark)
    •  Visualisation of data
  • Best practices for data cleaning & software engineering
    • Readable & documented code
    •  Logging
    • Handling null values
    • Data consistency, standardization & documentation


Session 2 (29 September): Advanced data preparation theory 

In this session, you will cover more advanced topics.

  • Best Practices for Feature Engineering
    • Unstructured data
    • Common operations
  • Data Observability & data quality
    • Data profiling
    • Unit testing
  • Moving to production
    • Docker
    • Workflow orchestrators (e.g., Dagster, Airflow)

 

Session 3 (13 October) and session 4 (20 October): Data visualization, tools & libraries

In these workshop sessions, you will get an introduction to data visualization for data science, analysis, and storytelling. You will get hands-on experience working with popular Python libraries and standard tools on multiple datasets. The module is divided into the following five segments:

  • Introduction to fundamentals of Data Visualization
  • Introduction to Data Visualization using Matplotlib, Pandas, and NumPy in Python
  • Advanced Data Visualization using Seaborn in Python
  • Interactive Data Visualization using Plotly in Python
  • Introduction to Data Reporting using Tableau
     

As a key takeaway from these sessions, along with the theoretical knowledge of data visualization, you will develop hands-on skills working with practical datasets. The code and the datasets will be provided to you and we will use Google Colab to practice the exercises from the first three segments. For the last segment, we recommend you to install Tableau Public.
 

Session 5 (27 October): Dashboards & linked visualizations

This session gives a hands-on introduction to dashboards and linked graphics, using Plotly Dash:

  • Page layout, dash components, custom components,
  • Recap interactive plots,
  • Input, output, callbacks,
    • Make components respond to each other.
  • Examples,
  • Integration of server-side computations and algorithms,
  • Deployment
     

Session 6 (10 November): Interactive graphics to explore complex data

This session covers creating visualizations of structured and complex data, and the integration of these into dashboards:

  • Visualization of (large) time series and collections of time series,
  • (Choropleth) Maps and their complexities, terrain visualization, Sankey diagram/flow map,
  • Visualization using constructed axes: dimensionality reduction/representation learning, visualization of graphs/networks (layout algorithms, chord diagram), image databases.
     

The lecturers

Quinten Rosseel (session 1 & 2) is the technical founder at dotdash.ai (a knowledge graph startup), and has data science & engineering experience across various organizations (Bingli, Unilin, Volvo Group, Tomorrowland, Atlas Copco).

 Katrien Verbert (session 3 & 4) is professor at the Augment research group of KU Leuven. She obtained a doctoral degree in Computer Science in 2008 at KU Leuven, Belgium. She was a postdoctoral researcher of the Research Foundation – Flanders (FWO) at KU Leuven. She was an Assistant Professor at TU Eindhoven, the Netherlands (2013 –2014) and  Vrije Universiteit Brussel, Belgium (2014 – 2015). Her research interests include visualization techniques, recommender systems, explainable AI, and visual analytics. She has been involved in several European and Flemish projects on these topics, including the EU ROLE, STELLAR, STELA, ABLE, LALA, PERSFO, Smart Tags and BigDataGrapes projects.

Aditya Bhattacharya (session 3 & 4) is a doctoral researcher on Explainable AI at the Augment research group of the Department of Computer Sciences, KU Leuven. He has obtained his Master of Science degree in Computer Science with a specialization in Machine Learning from Georgia Institute of Technology, USA. Before joining KU Leuven, Aditya has worked in multiple roles in organizations like Microsoft and Intel and has worked on multiple AI projects in domains related to Computer Vision, Natural Language Processing, Time Series Analysis, Classical Machine Learning, and Data Engineering.

Jefrey Lijffijt (session 5 & 6) is Professor of Data Science, Knowledge Discovery, and Visual Analytics at Ghent University. He has a background in algorithms, statistics, machine learning, and data visualization. He likes making tools that help others to better understand and utilize data.

Practical

Date and location

The training is held at KU Leuven Bruges Campus (Spoorwegstraat 12, 8200 Brugge) on these dates:

  • 22 September
  • 29 September
  • 13 October
  • 20 October
  • 27 October
  • 10 November

We welcome you with coffee from 9.00h. 
The sessions start at 9.30h and last until 13.00h, after which a sandwich lunch is offered.

To enrol

Register online before 15/09/2023. Because of the hands-on character of the training, the number of participants is limited to 15.

The fee for the series is 1800 euro. From the second participant of the same organisation, a discount of 20% is applied.

After registration, you will automatically receive the invoice for your participation.

kmo
Save on your participation costs via the kmo-portfolio. 
Click here for more information. Our approval number is DV.O102270. When submitting your application, you choose the theme Innovation and the Artificial Intelligence advice

Partners

With the support of the Flemish AI Academy

Meer opleidingen op KU Leuven Continue

Wie levenslang wil leren, is bij KU Leuven aan het juiste adres. Het aanbod van KU Leuven Continue stilt je honger naar kennis en kunde: van postgraduaatsopleidingen tot korte trainingen, lezingen, afstandsonderwijs en trajecten die werken en studeren combineren.

Ontdek KU Leuven Continue. Ook alle opleidingen van PUC - KU Leuven Continue vind je daar terug.

Verwante opleidingen

Techniek, engineering en industrie

×

Artificiële intelligentie

×

IT en digitale transformatie

×

keyboard_arrow_up