Fast graph RAG [https://github.com/circlemind-ai/fast-graphrag] - 2024-11-19 04:00:04 - public:mikeinmass machine-learning, work - 2 | id:1510374 -
Baidu Pan downloader [https://baidu.erranium.com/] - 2024-09-12 13:49:58 - public:mikeinmass internet, machine-learning, work - 3 | id:1492960 - Download from Baidu without installing their software
Transformer Architecture: The Positional Encoding - Amirhossein Kazemnejad's Blog [https://kazemnejad.com/blog/transformer_architecture_positional_encoding/] - 2024-08-08 17:12:40 - public:mikeinmass machine-learning, work - 2 | id:1492724 -
wav2letter/src/libraries/decoder/LexiconFreeDecoder.cpp at be863bb941108e95545b94fdf192722699295c63 · flashlight/wav2letter [https://github.com/flashlight/wav2letter/blob/be863bb941108e95545b94fdf192722699295c63/src/libraries/decoder/LexiconFreeDecoder.cpp#L70] - 2024-03-08 21:25:19 - public:mikeinmass speech, work - 2 | id:1489862 -
GitHub - kensho-technologies/pyctcdecode: A fast and lightweight python-based CTC beam search decoder for speech recognition. [https://github.com/kensho-technologies/pyctcdecode] - 2024-03-08 21:25:09 - public:mikeinmass speech, work - 2 | id:1489861 -
Mindee DocTR OCR [https://mindee.github.io/doctr/using_doctr/using_datasets.html] - 2024-01-24 05:01:26 - public:mikeinmass ocr, work - 2 | id:1489363 -
Text Detection Forgot About Document OCR [https://arxiv.org/abs/2210.07903] - 2024-01-23 23:38:27 - public:mikeinmass ocr, work - 2 | id:1489362 -
Attention vs kernel smoothing [https://news.ycombinator.com/item?id=38756888] - 2023-12-25 04:28:45 - public:mikeinmass machine-learning, work - 2 | id:1488928 -
MhLiao/DB: A PyTorch implementation of “Real-time Scene Text Detection with Differentiable Binarization“. [https://github.com/MhLiao/DB] - 2023-10-23 22:38:25 - public:mikeinmass ocr, work - 2 | id:1484970 -
paddlespeech.s2t.decoders.ctcdecoder.decoders_deprecated — paddle speech 2.1 documentation [https://paddlespeech.readthedocs.io/en/latest/_modules/paddlespeech/s2t/decoders/ctcdecoder/decoders_deprecated.html#ctc_beam_search_decoder] - 2023-10-19 17:44:27 - public:mikeinmass machine-learning, work - 2 | id:1484929 - Simple CTC beam search decoder
Prophet and time series prediction discussion [https://news.ycombinator.com/item?id=37663820] - 2023-09-27 02:15:00 - public:mikeinmass anomaly-detection, machine-learning, work - 3 | id:1484682 - Anomaly detection in time series good links
Handmade Transformer [https://vgel.me/posts/handmade-transformer/] - 2023-09-22 23:34:26 - public:mikeinmass machine-learning, work - 2 | id:1484640 -
On lattice free MMI and Chain models in Kaldi [https://desh2608.github.io/2019-05-21-chain/] - 2023-02-21 19:56:10 - public:mikeinmass kaldi, work - 2 | id:1315473 -
Multiple Attribute Text Style Transfer [https://github.com/clock-me/text-restyle/blob/main/sandbox.ipynb] - 2023-02-15 15:39:56 - public:mikeinmass python, work - 2 | id:1301854 -
pyannote audio diarizer [https://github.com/pyannote/pyannote-audio] - 2023-02-15 15:38:08 - public:mikeinmass python, speech, work - 3 | id:1301853 -
Self-Attentive VAD: Context-Aware Detection of Voice from Noise [https://github.com/voithru/voice-activity-detection] - 2023-02-15 15:27:24 - public:mikeinmass speech, work - 2 | id:1301849 -
pyctcdecode: a new beam search decoder for ctc [https://blog.kensho.com/pyctcdecode-a-new-beam-search-decoder-for-ctc-speech-recognition-2be3863afa96] - 2023-02-15 15:14:14 - public:mikeinmass speech, work - 2 | id:1301848 -
Self-Attentive VAD: Context-Aware Detection of Voice from Noise [https://ieeexplore.ieee.org/document/9413961] - 2023-02-15 15:13:26 - public:mikeinmass speech, work - 2 | id:1301847 -
AsoSoft Kurdish speech corpus [https://github.com/AsoSoft/AsoSoft-Speech-Corpus] - 2023-02-15 15:05:37 - public:mikeinmass speech, work - 2 | id:1301846 -
[2207.06057] Subband-based Generative Adversarial Network for Non-parallel Many-to-many Voice Conversion [https://arxiv.org/abs/2207.06057] - 2022-09-16 22:24:19 - public:mikeinmass work - 1 | id:1276611 -
Linear Predictive Coding is All-Pole Resonance Modeling [https://ccrma.stanford.edu/~hskim08/lpc/] - 2022-09-16 22:22:27 - public:mikeinmass speech, work - 2 | id:1276610 -
mvansegbroeck-zz/featxtra: Kaldi Speech Processing Tools [https://github.com/mvansegbroeck-zz/featxtra] - 2022-08-05 21:00:44 - public:mikeinmass kaldi, work - 2 | id:1222028 -
PracticalDeepLearningPython/chapter_15 at main · rkneusel9/PracticalDeepLearningPython [https://github.com/rkneusel9/PracticalDeepLearningPython/tree/main/chapter_15] - 2022-08-05 20:58:21 - public:mikeinmass machine-learning, work - 2 | id:1222026 -
[2112.02191] NN-LUT: Neural Approximation of Non-Linear Operations for Efficient Transformer Inference [https://arxiv.org/abs/2112.02191] - 2022-07-26 18:35:30 - public:mikeinmass machine-learning, work - 2 | id:1221902 -
KAIST Scene Text Ground Truth (text location, segmantation and recognition) - TC11 [http://www.iapr-tc11.org/mediawiki/index.php?title=KAIST_Scene_Text_Ground_Truth_(text_location,_segmantation_and_recognition)] - 2022-07-26 18:30:00 - public:mikeinmass ocr, work - 2 | id:1221901 -
Transformers: a Primer [http://www.columbia.edu/~jsl2239/transformers.html#dot_attention] - 2022-07-26 16:49:15 - public:mikeinmass machine-learning, work - 2 | id:1221897 -
Reading | VOiCES [https://iqtlabs.github.io/voices/downloads/] - 2022-07-26 16:05:52 - public:mikeinmass speech, work - 2 | id:1221894 -
Switchboard [https://isip.piconepress.com/projects/switchboard/] - 2022-07-26 15:56:46 - public:mikeinmass speech, work - 2 | id:1221893 -
Tutorial - What is a variational autoencoder? – Jaan Altosaar [https://jaan.io/what-is-variational-autoencoder-vae-tutorial/] - 2022-07-26 14:51:38 - public:mikeinmass machine-learning, work - 2 | id:1221886 -
Understanding VQ-VAE (DALL-E Explained Pt. 1) - ML@B Blog [https://ml.berkeley.edu/blog/posts/vq-vae/] - 2022-07-20 15:46:33 - public:mikeinmass work - 1 | id:1221793 -
kaldi-fork-active-grammar/laf-sub-nnet3.cc at master · daanzu/kaldi-fork-active-grammar · GitHub [https://github.com/daanzu/kaldi-fork-active-grammar/blob/master/src/dragonfly/laf-sub-nnet3.cc] - 2022-02-10 20:16:14 - public:mikeinmass kaldi, work - 2 | id:1022009 -
Grammar fst class template by all2ham · Pull Request #4067 · kaldi-asr/kaldi · GitHub [https://github.com/kaldi-asr/kaldi/pull/4067] - 2022-02-10 20:16:02 - public:mikeinmass kaldi, work - 2 | id:1022008 -
what G.fst to use when rescoring a lattice made by a make-grammar-fst created graph [https://groups.google.com/g/kaldi-help/c/W-r2o8mBVi4] - 2022-02-10 14:04:15 - public:mikeinmass kaldi, work - 2 | id:1021996 -
python - How can I host my own private conda repository? - Stack Overflow [https://stackoverflow.com/questions/35359147/how-can-i-host-my-own-private-conda-repository] - 2022-02-02 21:13:13 - public:mikeinmass python, work - 2 | id:1021842 -
Cosine Similarity is Euclidean Distance · [https://skeptric.com/cosine-is-euclidean/] - 2022-01-05 21:40:22 - public:mikeinmass math, work - 2 | id:980495 -