Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
48.7
TFLOPS
5
22
34
Nicolay Rusnachenko
nicolay-r
Follow
Jwrockon's profile picture
georgefreedland's profile picture
Bazsux's profile picture
119 followers
·
4 following
https://nicolayr.com/
nicolayr_
nicolay-r
nicolay-r
nicolay-r.bsky.social
AI & ML interests
Information Retrieval・Medical Multimodal NLP (🖼+📝) Research Fellow @BU_Research・software developer http://arekit.io・PhD in NLP
Recent Activity
posted
an
update
about 6 hours ago
📢 Several weeks ago Microsoft announced Phi-4. My most-recent list of LLM models have had only wrapper for Phi-2, so it was time to update! With this post, happy to share that Phi-4 wrapper is now available at nlp-thirdgate for adopting Chain-of-Thought reasoning: 🤖 https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/transformers_phi4.py 📒 https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_phi4.py Findings on adaptation: I was able to reproduce only the pipeline based model launching. This version is for textual llm only. Microsoft also released multimodal Phi-4 which is out of scope of this wrapper. 🌌 nlp-thirdgate: https://lnkd.in/ef-wBnNn
posted
an
update
1 day ago
📢 Delighted to announce the updated version of the no-string framework for chain-of-thought application over JSONL/CSV data: https://github.com/nicolay-r/bulk-chain/releases/tag/0.25.2 🔧 Fixes: - Fixed issues with batching mode - Fixed problem with parsing and passing args in shell mode ⚠️ Limitation: bathing mode is still available only via API. 📒 Quick Start with Gemma-3 in batching mode: https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_gemma_3.ipynb
replied
to
their
post
2 days ago
📢 With the recent release of Gemma-3, If you interested to play with textual chain-of-though, the notebook below is a wrapper over the the model (native transformers inference API) for passing the predefined schema of promps in batching mode. https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_gemma_3.ipynb Limitation: schema supports texts only (for now), while gemma-3 is a text+image to text. Model: https://huggingface.co/google/gemma-3-1b-it Provider: https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/transformers_gemma3.py
View all activity
Organizations
None yet
nicolay-r
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
3 models
3 days ago
google/gemma-3-12b-it
Image-Text-to-Text
•
Updated
4 days ago
•
65.6k
•
197
google/gemma-3-4b-it
Image-Text-to-Text
•
Updated
4 days ago
•
79.8k
•
204
google/gemma-3-1b-it
Text Generation
•
Updated
4 days ago
•
49.9k
•
155
liked
a model
5 days ago
meta-llama/Llama-3.2-1B-Instruct
Text Generation
•
Updated
Oct 24, 2024
•
2.7M
•
•
821
liked
a model
23 days ago
Qwen/Qwen2.5-3B-Instruct
Text Generation
•
Updated
Sep 25, 2024
•
652k
•
•
214
liked
2 models
about 1 month ago
facebook/mgenre-wiki
Text2Text Generation
•
Updated
Jan 24, 2023
•
950
•
•
28
sapienzanlp/relik-entity-linking-base
Updated
Aug 7, 2024
•
41
•
3
liked
a dataset
about 1 month ago
open-thoughts/OpenThoughts-114k
Viewer
•
Updated
24 days ago
•
228k
•
75.5k
•
654
liked
a Space
about 1 month ago
Running
558
558
Qwen2.5 Max Demo
🐢
Chat with an AI language model
liked
2 models
about 2 months ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
Text Generation
•
Updated
20 days ago
•
1.25M
•
548
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
20 days ago
•
2.07M
•
•
11.4k
liked
a Space
2 months ago
Running
on
CPU Upgrade
359
359
Open Medical-LLM Leaderboard
🥇
Browse and submit LLM evaluations
liked
a model
2 months ago
johnsnowlabs/JSL-MedLlama-3-8B-v2.0
Text Generation
•
Updated
Apr 30, 2024
•
4.05k
•
33
liked
a model
6 months ago
meta-llama/Llama-3.2-3B-Instruct
Text Generation
•
Updated
Oct 24, 2024
•
2.91M
•
•
1.22k
liked
a model
8 months ago
hyy-33/hyy33-WASSA-2024-Track-2
Updated
Jul 9, 2024
•
2
liked
3 models
9 months ago
google/gemma-2-9b-it
Text Generation
•
Updated
Aug 27, 2024
•
260k
•
•
687
google/gemma-2-27b-it
Text Generation
•
Updated
Aug 27, 2024
•
146k
•
•
541
Qwen/Qwen2-7B-Instruct
Text Generation
•
Updated
Aug 21, 2024
•
262k
•
•
621
liked
2 models
10 months ago
mistralai/Mistral-7B-Instruct-v0.3
Text Generation
•
Updated
Aug 21, 2024
•
951k
•
•
1.49k
microsoft/Phi-3-small-8k-instruct
Text Generation
•
Updated
Aug 30, 2024
•
43.7k
•
164
Load more