Kyle's picture

Kyle PRO

iky1e

·

https://ikyle.me

kylehowells

AI & ML interests

None yet

Recent Activity

updated a collection about 6 hours ago

Super-Resolution

updated a collection about 6 hours ago

Super-Resolution

liked a model about 6 hours ago

Acly/Real-ESRGAN-GGUF

View all activity

Organizations

upvoted a paper about 6 hours ago

One-Step Diffusion Transformer for Controllable Real-World Image Super-Resolution

Paper • 2511.17138 • Published Nov 21, 2025 • 2

upvoted a paper about 8 hours ago

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale

Paper • 2504.16030 • Published Apr 22, 2025 • 37

upvoted a collection 1 day ago

AuraSR

Fastest super resolution model for AI generated images • 2 items • Updated Jul 30, 2024 • 7

upvoted 2 papers 1 day ago

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Paper • 2509.24650 • Published Sep 29, 2025 • 6

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 137

upvoted a paper 7 days ago

OmniVoice: Towards Omnilingual Zero-Shot Text-to-Speech with Diffusion Language Models

Paper • 2604.00688 • Published 9 days ago • 7

upvoted a collection 7 days ago

Gemma 4

8 items • Updated 8 days ago • 536

upvoted a collection 8 days ago

Bonsai

1-bit Bonsai models • 6 items • Updated 10 days ago • 162

upvoted an article 9 days ago

Article

Speculative Decoding for 2x Faster Whisper Inference

Dec 20, 2023

•

32

upvoted a paper 12 days ago

Voxtral TTS

Paper • 2603.25551 • Published 15 days ago • 57

upvoted a collection 13 days ago

UnifoLM_WBT_Dataset

8 items • Updated 14 days ago • 78

upvoted an article 14 days ago

Article

Introducing Cohere-transcribe: state-of-the-art speech recognition

15 days ago

•

35

upvoted a paper 24 days ago

Canary-1B-v2 & Parakeet-TDT-0.6B-v3: Efficient and High-Performance Models for Multilingual ASR and AST

Paper • 2509.14128 • Published Sep 17, 2025 • 2

upvoted a collection 24 days ago

Demucs MLX — Music Source Separation

Demucs music stem separation for Apple Silicon. Float32 and float16 variants. • 2 items • Updated 24 days ago • 1

upvoted a collection 25 days ago

Granite Speech Models

Multilingual ASR and speech-to-text (STT) models for enterprise transcription and translation. • 6 items • Updated 9 days ago • 24

upvoted a collection 27 days ago

DeepFilterNet-MLX

MLX ports of the DeepFilterNet speech enhancement models for Apple Silicon • 7 items • Updated 27 days ago • 1

upvoted 4 papers 27 days ago

DeepFilterNet: A Low Complexity Speech Enhancement Framework for Full-Band Audio based on Deep Filtering

Paper • 2110.05588 • Published Oct 11, 2021 • 1

DeepFilterNet2: Towards Real-Time Speech Enhancement on Embedded Devices for Full-Band Audio

Paper • 2205.05474 • Published May 11, 2022 • 1

DeepFilterNet: Perceptually Motivated Real-Time Speech Enhancement

Paper • 2305.08227 • Published May 14, 2023 • 2

Sasha: Creative Goal-Oriented Reasoning in Smart Homes with Large Language Models

Paper • 2305.09802 • Published May 16, 2023 • 1