Skip to content
Change the repository type filter

All

    Repositories list

    • Command line utility for forced alignment using Kaldi
      Python
      2651.6k2661Updated Sep 25, 2025Sep 25, 2025
    • Demo corpus for THCHS-30
      0000Updated Sep 24, 2025Sep 24, 2025
    • Demo corpus for JVS
      0000Updated Sep 24, 2025Sep 24, 2025
    • Demo corpus for librispeech
      0000Updated Sep 22, 2025Sep 22, 2025
    • kalpy

      Public
      Pybind11 bindings for Kaldi
      Jupyter Notebook
      31400Updated Sep 17, 2025Sep 17, 2025
    • Aligned version of LibriSpeech test-clean for use in benchmarking English forced alignment
      0000Updated Sep 14, 2025Sep 14, 2025
    • PolyglotDB is a package for phonetic corpus storage and analysis
      Python
      1748412Updated Aug 14, 2025Aug 14, 2025
    • Multivariate modeled overlap: a method for measuring vowel merger
      HTML
      0000Updated Jul 25, 2025Jul 25, 2025
    • Anchor annotator is a program for inspecting corpora for the Montreal Forced Aligner and correcting transcriptions and pronunciations
      Python
      1510Updated Jul 14, 2025Jul 14, 2025
    • 0000Updated Jul 14, 2025Jul 14, 2025
    • Collection of pretrained models for the Montreal Forced Aligner
      Python
      26167250Updated Jun 16, 2025Jun 16, 2025
    • Scripts and pipeline for determining a usable subset of the AudioBNC corpus
      Python
      0002Updated Jul 6, 2023Jul 6, 2023
    • SPADE

      Public
      Anything SPADE-related not covered in another repository, including scripts for analyzing SPADE datasets using PolyglotDB.
      R
      4141Updated Dec 27, 2022Dec 27, 2022
    • ISCAN

      Public
      Development repository for Integrated Speech Corpus Analaysis (ISCAN)
      JavaScript
      010332Updated Jul 15, 2022Jul 15, 2022
    • Django server set up for the SPADE project
      Python
      0111Updated Jan 6, 2022Jan 6, 2022
    • 0000Updated Oct 12, 2021Oct 12, 2021
    • Phonetisaurus G2P
      Python
      125100Updated Jun 22, 2021Jun 22, 2021
    • Read-only unofficial mirror of the OpenGrm NGram Library
      C++
      3000Updated Jun 22, 2021Jun 22, 2021
    • Bencharking suites for PolyglotDB
      Python
      2001Updated Jun 22, 2021Jun 22, 2021
    • resources for SCT (this should be made private)
      Python
      3000Updated Jun 22, 2021Jun 22, 2021
    • Test datasets for the Montreal Forced Aligner
      0100Updated Jun 22, 2021Jun 22, 2021
    • Easier analysis of large speech corpora
      Python
      82350Updated Jun 22, 2021Jun 22, 2021
    • kaldi

      Public
      This is now the official location of the Kaldi project.
      Shell
      5.4k100Updated Jun 22, 2021Jun 22, 2021
    • Collection of scripts and utilities for reorganizing corpora to use with the Montreal Forced Aligner
      Python
      64410Updated Jun 22, 2021Jun 22, 2021