Skip to content

Tools catalog

Your full AI engineering stack.

81+ production tools — every layer of the stack, taught across the six schools. Filter by school, category, or how you use the tool.

81 of 81

Catalog

Every tool Safua learners touch.

Filters81 tools

By school

By category

By depth

  • Python

    Used

    The default language of modern AI engineering. Type hints, async, strong stdlib.

    DataMLAIAgentsMLOpsSafety
  • TypeScript

    Used

    Typed JavaScript for AI-app front ends and Node services that call AI APIs.

    AIAgents
  • Rust

    Used

    Systems-grade language for latency-sensitive inference and control planes.

    MLOps
  • Go

    Used

    Concise, concurrent language powering a large share of cloud-native infrastructure.

    MLOpsAI
  • SQL

    Used

    The lingua franca of data. Windowing, joins, query plans — well beyond SELECT *.

    DataAIML
  • PostgreSQL

    Used

    The reliable default for transactional data. Strong ecosystem, battle-tested.

    DataAIMLOps
  • Redis

    Used

    In-memory key-value + caching. The reflexive choice for hot paths and queues.

    DataAIMLOps
  • MySQL

    Used

    Classic relational engine still running a sizeable fraction of the internet.

    Data
  • MongoDB

    Used

    Document database. Used where schemas are fluid and nested documents are the unit.

    AI
  • SQLite

    Used

    Embedded SQL engine. Perfect for local development and eval harnesses.

    Data
  • Cassandra

    Used

    Wide-column distributed database for write-heavy, globally replicated workloads.

    Data
  • Elasticsearch

    Used

    Full-text search + analytics engine. Supports hybrid text + vector retrieval.

    DataAI
  • Snowflake

    Used

    Cloud data warehouse separating compute from storage for elastic analytics.

    Data
  • DuckDB

    Used

    In-process analytical database. Run warehouse-grade queries inside a notebook.

    DataML
  • ClickHouse

    Used

    Columnar analytical database built for sub-second queries over huge tables.

    Data
  • MinIO

    UsedProvisioned

    S3-compatible object storage you can run on your own hardware.

    MLOpsData
  • pgvector

    Used

    Vector search inside Postgres. One database, embedding-aware retrieval.

    AIAgents
  • Qdrant

    Used

    Purpose-built vector database with filtering and payload storage.

    AIAgents
  • Weaviate

    Used

    Vector database with built-in reranking and multi-tenant retrieval.

    AIAgents
  • Chroma

    Used

    Developer-friendly embedding store that runs in-process or as a service.

    AIAgents
  • Milvus

    Used

    Distributed vector database designed for billion-scale similarity search.

    AI
  • Airflow

    Used

    Author, schedule, and monitor batch data pipelines as code.

    DataMLOps
  • Kafka

    Used

    Distributed log and streaming platform. The backbone of event-driven systems.

    DataMLOps
  • Spark

    Used

    Distributed compute engine for big-data batch and streaming workloads.

    DataML
  • dbt

    Used

    Transform data in the warehouse with versioned, tested SQL models.

    Data
  • Prefect

    Used

    Python-native workflow orchestration with strong retry + observability defaults.

    DataMLOps
  • Temporal

    Used

    Durable execution for long-running workflows, agent state, and human-in-the-loop.

    MLOpsAI
  • Flink

    Used

    Distributed stream-processing engine for stateful, low-latency streaming.

    Data
  • PyTorch

    Used

    Deep-learning framework with dynamic graphs. The research default turned production.

    MLAISafety
  • TensorFlow

    Used

    Deep-learning framework with a strong production-serving story.

    ML
  • scikit-learn

    Used

    Classical ML: linear models, trees, SVMs, calibration — the rigorous baseline.

    MLData
  • Hugging Face

    Used

    Transformers library + model hub. Standard for modern NLP + multimodal.

    MLAIAgents
  • pandas

    Used

    DataFrames for data manipulation and exploration in Python.

    DataML
  • NumPy

    Used

    N-dimensional arrays and the math that every ML library depends on.

    MLData
  • XGBoost

    Used

    Gradient boosting library. Still the thing that wins most tabular competitions.

    MLData
  • LightGBM

    Used

    Fast gradient-boosting framework tuned for large feature spaces.

    ML
  • JAX

    Used

    Composable transformations of numerical code. Auto-diff + XLA for scale.

    ML
  • PyTorch Lightning

    Used

    High-level wrapper around PyTorch that removes training-loop boilerplate.

    ML
  • Optuna

    Used

    Hyperparameter optimisation framework with pruning and parallel trials.

    ML
  • Keras

    Used

    High-level deep-learning API, now unified across TensorFlow, PyTorch, and JAX.

    ML
  • Jupyter

    Used

    Notebook environment for exploration, teaching, and reproducible reports.

    MLDataSafety
  • Streamlit

    Used

    Turn a Python script into a demo UI in minutes. Great for internal tools.

    MLAI
  • MLflow

    Used

    Experiment tracking, model registry, and deployment tooling — open-source.

    MLMLOps
  • Weights & Biases

    Used

    Experiment tracking + model registry with collaborative dashboards.

    MLMLOps
  • TensorBoard

    Used

    Training-run visualisation for losses, gradients, embeddings, and images.

    ML
  • Docker

    UsedProvisioned

    Container runtime for reproducible application packaging and local dev.

    MLOpsAIData
  • Kubernetes

    UsedProvisioned

    Container orchestration for scheduling, scaling, and networking services.

    MLOps
  • GitHub Actions

    Used

    CI/CD for test, build, and deploy workflows directly in your repo.

    MLOps
  • Terraform

    Provisioned

    Infrastructure-as-code for provisioning cloud resources reproducibly.

    MLOps
  • Helm

    Provisioned

    Package manager for Kubernetes: parameterised, versioned app deployments.

    MLOps
  • Argo CD

    Provisioned

    GitOps continuous delivery for Kubernetes — the manifest in the repo is the truth.

    MLOps
  • Grafana

    Used

    Dashboards + alerting over metrics, logs, and traces. The visualisation layer.

    MLOpsData
  • Prometheus

    Used

    Time-series metrics database with pull-based collection and PromQL.

    MLOps
  • Datadog

    Used

    Unified observability platform — metrics, traces, logs, RUM, and more.

    MLOps
  • Sentry

    Used

    Error tracking + performance monitoring for applications and LLM features.

    MLOpsAI
  • OpenTelemetry

    Used

    Vendor-neutral standard for traces, metrics, and logs across stacks.

    MLOps
  • Jaeger

    Used

    Distributed tracing system for following requests across services.

    MLOps
  • LangChain

    Used

    Framework for LLM applications: chains, tools, retrieval, agents.

    AIAgents
  • LlamaIndex

    Used

    Data framework for LLM apps — ingestion, indexing, retrieval, query engines.

    AIAgents
  • OpenAI SDK

    Used

    Official client library for the OpenAI HTTP API — installs as a package.

    AIAgents
  • Anthropic SDK

    Used

    Official client library for the Anthropic HTTP API — installs as a package.

    AIAgentsSafety
  • Ollama

    Used

    Run open-weight LLMs locally with one binary. Great for air-gapped evaluation.

    AISafety
  • AutoGen

    Used

    Multi-agent orchestration framework with conversational and tool-use primitives.

    Agents
  • CrewAI

    Used

    Role-based multi-agent framework with explicit tasks, tools, and memory.

    Agents
  • LangGraph

    Used

    Graph-based agent orchestration with typed state and durable execution.

    Agents
  • LangSmith

    Used

    Trace + evaluate LLM-application runs with structured datasets and graders.

    AIAgents
  • Guardrails AI

    Used

    Structured output validation and safety checks for LLM applications.

    SafetyAI
  • SHAP

    Used

    Shapley-value feature attribution for any model. The interpretability default.

    SafetyML
  • LIME

    Used

    Local interpretable model-agnostic explanations for individual predictions.

    Safety
  • Great Expectations

    Used

    Declarative data-quality tests that travel with your pipelines.

    DataSafety
  • Bleach

    Used

    HTML sanitisation for LLM output bound for user-facing surfaces.

    Safety
  • Pydantic

    Used

    Runtime data validation via type annotations. The backbone of safe JSON APIs.

    AISafetyData
  • pytest

    Used

    Python test framework with fixtures, parametrisation, and plugin ecosystem.

    DataMLAIMLOpsSafety
  • Trivy

    Used

    Open-source vulnerability scanner for containers, filesystems, and IaC.

    MLOpsSafety
  • Istio

    Provisioned

    Service mesh for traffic management, security, and observability in Kubernetes.

    MLOps
  • Envoy

    Provisioned

    High-performance edge and service proxy — the data plane behind many meshes.

    MLOps
  • Vault

    Provisioned

    Secrets management, encryption-as-a-service, and identity-based access.

    MLOpsSafety
  • Consul

    Provisioned

    Service discovery, service mesh, and config management for distributed systems.

    MLOps
  • NGINX

    Used

    Web server, reverse proxy, and load balancer that quietly runs the internet.

    MLOps
  • FastAPI

    Used

    Python web framework with type-driven validation. Great for AI-app backends.

    AIMLOps
  • Node.js

    Used

    JavaScript runtime for backends, serverless functions, and LLM-app BFFs.

    AI

Don’t see your tool? Custom missions can incorporate additional tools through the platform’s extensibility system. Talk to us.

Faculty

Tools alone don’t graduate an engineer.

Named reviewers grade how you use these tools across every school.

One catalog. Every skill you need to ship.

Real tools, real workflows, real reviews from named faculty.