Polli Linnaeus Documentation Hub

Welcome to the official documentation for Polli Linnaeus, an open-source deep learning framework designed for taxonomic image classification and biodiversity monitoring applications. This hub provides a central point for accessing all documentation resources, whether you're looking to quickly use a pre-trained model, train your own custom models, or dive deep into the framework's architecture.

Getting Started

New to Polli Linnaeus? These guides will help you get up and running.

Project README: Start here for a general overview of the project, its goals, and quick installation/usage notes.
Installation Guide: Detailed instructions for installing Polli Linnaeus and its dependencies.
Getting Started Tutorial: A beginner-friendly introduction to the basic functionalities of Linnaeus.

Using Pre-trained Models

Leverage our suite of pre-trained models for your classification tasks.

Model Zoo: Discover available pre-trained models, their taxonomic scope, and links to Hugging Face Hub.
Running Inference with Pre-trained Models: A step-by-step tutorial on how to use our pre-trained models for inference on your own images.
Inference Overview: Learn about the core components of the Linnaeus inference pipeline, including the LinnaeusInferenceHandler.

Training Models

Guides for researchers and developers looking to train or fine-tune models.

Training Your First Polli Linnaeus Model: A comprehensive tutorial on preparing your custom dataset (HDF5 format), configuring experiments, and launching training runs.
Training Overview: An overview of the training system, including data loading, augmentations, multi-task learning, and more.
Data Loading for Training: Detailed information on how Linnaeus handles data, with a focus on the HDF5 format.
Phase 2: Abstention Training with RL (Experimental): Learn how to fine-tune pre-trained models to learn abstention behavior using Reinforcement Learning.

Datasets

Understanding the data used to train and evaluate models.

Official Dataset Provenance (ibrida-v0-r1): Detailed information on the provenance and filtering logic used to create the initial batch of pre-trained models from the iNaturalist Open Data. This is crucial for understanding model scope and limitations.

Advanced Topics

For users who want to explore more specialized features of Linnaeus.

Advanced Topics Index: Links to guides on:
- Automatic Batch Sizing
- Development Features & Debug Flags
- Hierarchical Approaches (Taxonomy-Guided Label Smoothing, Hierarchical Heads)
- Taxonomy Representation
- Training Progress Tracking

Developer Documentation

Resources for those looking to extend or contribute to Polli Linnaeus.

Model System Overview: Understand the architectural design, component registries, and how to create new model architectures.
CI & Docker Guide: Learn about the continuous integration setup and Docker image architecture.
Docker Build Guide: Detailed documentation on building and maintaining the Linnaeus Docker images.
Development Guides: (Link to docs/dev/ directory if it contains an index or relevant files - e.g., docs/dev/01_training_loop_and_progress.md) Explore notes on the training loop, scheduling system, metrics, and other internal design choices.
Contributing Guidelines: (To be created) How to contribute to the Polli Linnaeus project.

Known Limitations

Known Limitations: Current limitations and known issues in Polli Linnaeus, along with recommended workarounds.

If you can't find what you're looking for, please consider opening an issue on our GitHub repository.