Training & Related Services

Advanced PyData Courses

Basic Python courses are easy to come by but when you want to dig deep, the options are few. Building off decades of experience training professionals in academic and industry contexts, Quansight delivers expert custom training on data science, scientific computing, high-performance computing, machine learning, and other essential topics.

Corporate Curriculum Development

If standardized courses aren’t the best option for you, Quansight will design custom courseware—computational notebooks, white papers, videos, and more—for efficiently onboarding your team at scale. We’ll break down your technology stack into tailored learning solutions suitable for your unique needs.

A transparent, black and white Polars logo

Polars

Polars is a Rust-written DataFrame library that enables high-speed data processing. This training offers hands-on experience in optimizing data analysis through intuitive APIs, parallel processing, and handling large datasets efficiently, which is ideal for professionals eager to master advanced data science techniques.

Image of the Dask logo

Dask

Introduces Dask for scaling data analysis in Python. This workshop comprises initial explorations of the technical limits of NumPy and Pandas, the fundamentals of parallel computing in Python, using Dask arrays and dataframes in practice, and introduces machine learning with Dask.

Image of the Rapids logo

RAPIDS

Introduces the open source Python RAPIDS libraries for accelerating computation with GPUs (graphics processing units). Participants practice using the RAPIDS libraries for common ETL and machine learning workloads without having to program with low-level languages (e.g., C/C++).

Image of the Open CV logo

Computer Vision

Prepares practitioners to tackle the automated analysis and interpretation of images with practical computer vision systems. This includes the rudiments of computer vision theory and methods (e.g., feature extraction, object recognition, registration, segmentation).

Image of the xarray logo

Xarray

Introduces participants to the Xarray project for manipulating multi-channel data (e.g., as it occurs commonly in geosciences, etc.). Participants practice using Xarray for data analysis extending techniques from Pandas and NumPy to high-dimensional labeled arrays.

Image of the Dask logo

Dask-ML

Introduces participants to Dask-ML for scaling standard Python machine learning tools (e.g., scikit-learn, XGBoost). Participants apply various pre-built models on moderate-to-large datasets to learn best practices for parallel and out-of-core machine learning.

Image of the PyTorch logo

Deep Learning

Introduces participants to the foundations of deep learning. Participants practice constructing neural networks of various levels of complexity to connect the core ideas to their realization in practical applications (e.g., image processing, natural language processing).

Image of the HoloViz logo

Visualization

Builds techniques for web-based data exploration and interactive-app development in Python using open source Holoviz libraries (e.g., HoloViews, HvPlot, Datashader, and Panel). These tools enable constructing rich, high-performance, scalable, flexible, and deployable visualizations easily.

Image of the Numba logo

Numba

Introduces participants to Numba, a tool for Just-in-Time compilation of Python code. Participants practice profiling sample application codes and accelerating them with Numba.

Intake

Overviews Intake, a lightweight package for finding, investigating, loading, and disseminating data. Participants will learn the fundamentals of using Intake to deploy data as would be required in applied industrial settings under various constraints (e.g., hardware, security).

Image of the Pangeo logo

Pangeo

Introduces Pangeo as a community platform for Big Data geoscience. Participants will experiment with large geophysical datasets and tools like Dask & Xarray to familiarize themselves with the Pangeo stack.

Image of the Python logo

Advanced Python

Provides a deep-dive into internal features of the Python programming language as related to asynchronous computation, concurrency, efficiency, functional programming, and object-oriented design.

Image of the Nebari logo

Nebari

Introduces participants to the administrative tools and configuration options for Nebari, an open source data science platform developed for collaboration and scalability. Participants will learn how to create, reconfigure, and manage the robust infrastructure that can be deployed rapidly using Nebari. This training package includes designated office hours for questions.

Ready to level up your PyData training?