AI & ML interests

Data for Everyone

ZennyKennyĀ 
posted an update 16 days ago
view post
Post
1950
šŸ“ One of the coolest parts about being an early Strawberry user has been the opportunity to build on the app at the ground floor.

The platform already has a ton of great integrations that let you interact with your external apps directly with tools, but I wanted to add the ability to do stuff in Slack as well.

šŸ’Ŗ So I took the base Anthropic Slack MCP server, added a whole bunch of new tools, and generalized it as an HTTP-based SSE-server and deployed it in like 2 minutes with Railway so that Strawberry could make use of it (as can Claude or any other MCP client).

Now, you can Chat with your Strawberry Companion (or Claude, or whatever) and do things like:
āž”ļø Get caught up across all of your Slack channels after a long weekend or noisy incident without having to read 20 threads in 10 different channels
āž”ļø Create, read, and edit Canvases, Messages, and Channels
āž”ļø Take any resources or content that you're using in your Chat and inject it directly into Slack without copy / paste

šŸ˜Ž I'm pretty pleased with the results, and I made a short demo video showing the results of the work (link in comments). The best part is, it's available on GitHub for anyone else to use too (link in the comments, instructions in the README). The setup takes about 5-10 minutes.
  • 2 replies
Ā·
ZennyKennyĀ 
posted an update 28 days ago
view post
Post
213
What a trip. Just walked through @burtenshaw and @evalstate tutorial on adding Hugging Face Skills to your Claude Code agent so you can fine tune LLMs by chatting with AI.

These are the kinds of innovations that are going to help everyone benefit from the power of Artificial Intelligence. Well done gentlemen and thank you for sharing.
  • 1 reply
Ā·
ZennyKennyĀ 
posted an update about 1 month ago
view post
Post
253
😐 I keep seeing takes on LinkedIn from American business influencers melting down about Silicon Valley startup "dependence" on open-source Chinese models.

šŸ¤” Can anyone describe a credible scenario where these models can be leveraged by the Chinese government to endanger American security interests or am I right to believe that this is just Red Scare nonsense?
  • 2 replies
Ā·
ZennyKennyĀ 
posted an update about 1 month ago
view post
Post
429
The #feedback channel of app early access Slack Workspaces is some of the best unintentional comedy material I have ever come across tbh.
ZennyKennyĀ 
posted an update about 2 months ago
view post
Post
3158
šŸŽ‰ Wow. Congratulations @bfirsh and the Replicate team on the CloudFlare acquisition!

āœŒļø You've really built an incredible ecosystem and product offering and should be super proud.
ZennyKennyĀ 
posted an update about 2 months ago
view post
Post
334
šŸŽ‰ Novoyaz is live.

A few months ago, I built a quick POC in Hugging Face that used a fine-tuned variant of OpenAI's OSS-20B model that I trained to convert the text from pre-reform Russian-language documents into modern Russian orthography.

āš”ļø This morning, I launched novoyaz.io.

This is a production app, the frontend for which I built in like two hours with Lovable, that uses that same fine-tuned model for transliteration, but now has a bunch of extra features that make using it even easier (like taking and uploading pictures with your on-device camera for example šŸ˜…).

šŸ‘‰ If you're a researcher, or know a researcher, for whom this app will improve their day-to-day workflows, please get in touch with me.
ZennyKennyĀ 
posted an update 2 months ago
view post
Post
346
Anyone got the scoop on a good OCR model that's available on inference?

Keen to make use of an endpoint (gated or not -- happy to pay for usage) for a personal project, but not so keen to pay for the GPU hosting myself.

šŸ™ˆšŸ™ˆšŸ™ˆ
  • 4 replies
Ā·
ZennyKennyĀ 
posted an update 2 months ago
ZennyKennyĀ 
posted an update 3 months ago
view post
Post
2178
Did Hugging Face just ban hammer a bunch of bot accounts or am I just so uninteresting that 30% of my subs dropped me overnight?

😬 Wait, don't answer that.
  • 2 replies
Ā·
ZennyKennyĀ 
posted an update 3 months ago
ZennyKennyĀ 
posted an update 3 months ago
view post
Post
1244
🄊 Big Code Arena is live! bigcode/arena

šŸ’”
bigcode
is an open scientific collaboration working on responsible training of large language models for coding applications.

šŸ‘‰ The Arena ranks LLMs based on their ability to support natural language vibe coding requests in a competitive format, based on feedback from human reviewers.

🧠 It was a pleasure to contribute to this project led by @terryyz and appear as an additional contributor in the Big Code Arena paper.
ZennyKennyĀ 
posted an update 3 months ago
view post
Post
8915
šŸ–¤ Probably one of my favorite projects that I've worked on so far, introducing ŠŠ¾Š²Š¾ŃŠ· (Novoyaz).

šŸ›  One of the first acts of the Bolshevik government after the Russian Revolution was the reform and standardization of the Russian language, which at the time had a non-standard and challenging orthography.

šŸ“š Upon its reform the government launched a nationwide campaign called Ликбез (Likbez), which sought to improve literacy in the country (by the way, it worked, bringing the national literacy rate from <20% in the 1920s to >80% by the 1930s).

‼ While this is a remarkable result that should absolutely be celebrated, it's one that has left behind literally hundreds of thousands if not millions of artifacts using pre-reform Russian orthography.

šŸ˜“ Researchers and historians are working tirelessly to translate these artifacts to modern Russian so that they may be archived and studied but many have told me that. they are doing this BY HAND (!).

šŸ’” I thought, well this is a perfect use case for OCR and a fine-tuned LLM to step in and help to aid in this important work!

šŸŒ Introducing ŠŠžŠ’ŠžŠÆŠ— (NOVOYAZ)! Powered by ChatDOC/OCRFlux-3B and https://huggingface.co/ZennyKenny/oss-20b-prereform-to-modern-ru-merged, researchers can now convert images of their pre-reform documents to modern Russian orthography using the power of open-source AI!

Check it out and drop a like to support more real-world use cases for open source AI outside of traditional tech-centric domains!

ZennyKenny/Novoyaz
ZennyKennyĀ 
posted an update 3 months ago
view post
Post
563
šŸ”’ Like a lot of other AI builders, I have some anxiety about the emerging surveillance-capitalist paradigm emerging in the AI space.

šŸ‘‰ Of course-- this kind of thing isn't completely new and has been going on for decades, but the difference is the stronger immersion of AI tools into our daily lives (compared to something like a search engine or social network).

ā• That's why I was really excited to come across Lumo: https://lumo.proton.me/u/1/

ā• Lumo is created by
ProtonPrivacy
and offers privacy-first features that make sure that what you do with you AI assistant is your business.

ā• I already trust Proton with my other business apps and I've never been disappointed, plus the Lumo architecture is really fantastic, dynamically routing each query to the most appropriate model for the request.

šŸ”„ Really awesome stuff Proton, thank you as always.
ZennyKennyĀ 
posted an update 3 months ago
view post
Post
2383
The reactions to mostlyai/synthetic-sdk-demo have been incredible! šŸ”„

Some users wrote that they were having performance issues on larger datasets, so I've capped the Space's input to 5000 rows and 10 columns, but you can always use the open source SDK that powers the space any time you want on datasets of arbitrary size and shape!

Check it out: https://github.com/mostly-ai/mostlyai šŸ‘ˆ
ZennyKennyĀ 
posted an update 4 months ago
view post
Post
2640
The open source Synthetic Data SDK from MOSTLY AI:
mostlyai
offers the ability to generate realistic, privacy-safe synthetic data with just a few lines of Python.

Try it out yourself in a No Code UI in the SDK Demo Space: mostlyai/synthetic-sdk-demo