Hacker News — vinext + Netlify

new
past
show
ask
show
jobs
submit

▲Parakeet.cpp – Parakeet ASR inference in pure C++ with Metal GPU acceleration (github.com)

32 points by noahkay13 4 hours ago | 4 comments

noahkay13 4 hours ago [-]

I built a C++ inference engine for NVIDIA's Parakeet speech recognition models using Axiom(https://github.com/Frikallo/axiom) my tensor library.

What it does: - Runs 7 model families: offline transcription (CTC, RNNT, TDT, TDT-CTC), streaming (EOU, Nemotron), and speaker diarization (Sortformer) - Word-level timestamps - Streaming transcription from microphone input - Speaker diarization detecting up to 4 speakers

nullandvoid 12 minutes ago [-]

I've been using handy with parakeet on both Windows and mac, and have been very impressed.

Hoe does this compare?

cakealert 32 minutes ago [-]

You should consider offering a python library.

With models like these often you want to glue things together and manage multithreaded queues. And gluing in cpp is no fun.

This assumes that your offering will perform better than onnxruntime on at least some metric such as memory.

ghostpepper 3 hours ago [-]

Off topic but if anyone is looking for a nice web-GUI frontend for a locally-hosted transcription engine, Scriberr is nice

https://github.com/rishikanthc/Scriberr

Rendered at 07:48:12 GMT+0000 (Coordinated Universal Time) with Netlify.