Skip to content

A curated collection of high-quality AI implementations developed by researchers and engineers at the Vector Institute

Browse Implementations View on GitHub
73
Implementations
7
Years of Research

Browse Implementations by Type

atomgen

2024 applied-research

Library for handling atomistic graph datasets focusing on transformer-based implementations, with utilities for training various models, experimenting with different pre-training tasks, and a suite of pre-trained models with huggingface integrations

AtomFormer SchNet TokenGT

kg-rag

2025 applied-research

A comprehensive framework for Knowledge Graph Retrieval Augmented Generation (KG-RAG).

Datasets: SEC 10-Q

pmc-data-extraction

2024 applied-research

A toolkit to download, augment, and benchmark Open-PMC data

bias-mitigation-unlearning

2024 applied-research

A repository for social bias mitigation in LLMs using machine unlearning

anomaly-detection

2023 bootcamp

A repository with implementation of anomaly detection techniques

Logistic Regression (Supervised) Random Forest (Supervised) XGBoost (Supervised) CatBoost (Supervised) Light GBM (Supervised) TabNet (Supervised and Semi-supervised) Autoencoder (AE) (Unsupervised) Isolation Forest (Unsupervised)

recommender-systems

2022 bootcamp

A repository with implementations of recommender systems

Matrix Factorization Collaborative Filtering Content-Based Filtering Sequence Aware Recommender Systems Session-Based Recommender Systems Knowledge Graph-Based Recommender Systems

A repository with implementations of privacy-enhancing techniques for machine learning

Differential Privacy (tensorflow_privacy) PATE Membership Inference Attacks Horizontal Federated Learning Vertical Federated Learning Homomorphic Encryption

self-supervised-learning

2024 bootcamp

A repository with reference implementations of self-supervised learning techniques

diffusion-models

2024 bootcamp

A repository with demos for various diffusion models for tabular and time series data

TabDDPM TabSyn ClavaDDPM CSDI TSDiff

ai-deployment

2024 bootcamp

A repository with reference implementations for deploying AI models in production environments, focusing on best practices and cloud-native solutions.

A repository reference implementations for retrieval-augmented generation

Web Search Document Search SQL Search Cloud Search PubMed QA RAG Evaluation

finetuning-and-alignment

2024 bootcamp

A repository with implementations advanced fine-tuning techniques and approaches to enhance Large Language Model performance, reduce their computational cost, with a focus on alignment with human values

FSDP DDP Instruction Tuning PEFT Quantization Supervised Fine-tuning

fed-rag

2025 tool

A framework for fine-tuning retrieval-augmented generation (RAG) systems.

Basic fine-tuning with FL RA-DIT

vector-inference

2024 tool

Efficient LLM inference on Slurm clusters using vLLM.

mmlearn

2024 tool

A toolkit for research on multimodal representation learning

Contrastive Pretraining I-JEPA

fl4health

2024 tool

A flexible, modular, and easy to use library to facilitate federated learning research and development in healthcare settings