Nicolay Rusnachenko

nicolay-r

AI & ML interests

Information Retrieval・Medical Multimodal NLP (🖼+📝) Research Fellow @BU_Research・software developer http://arekit.io・PhD in NLP

Recent Activity

posted an update about 7 hours ago

📢 Several weeks ago Microsoft announced Phi-4. My most-recent list of LLM models have had only wrapper for Phi-2, so it was time to update! With this post, happy to share that Phi-4 wrapper is now available at nlp-thirdgate for adopting Chain-of-Thought reasoning: 🤖 https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/transformers_phi4.py 📒 https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_phi4.py Findings on adaptation: I was able to reproduce only the pipeline based model launching. This version is for textual llm only. Microsoft also released multimodal Phi-4 which is out of scope of this wrapper. 🌌 nlp-thirdgate: https://lnkd.in/ef-wBnNn

posted an update 1 day ago

📢 Delighted to announce the updated version of the no-string framework for chain-of-thought application over JSONL/CSV data: https://github.com/nicolay-r/bulk-chain/releases/tag/0.25.2 🔧 Fixes: - Fixed issues with batching mode - Fixed problem with parsing and passing args in shell mode ⚠️ Limitation: bathing mode is still available only via API. 📒 Quick Start with Gemma-3 in batching mode: https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_gemma_3.ipynb

replied to their post 2 days ago

📢 With the recent release of Gemma-3, If you interested to play with textual chain-of-though, the notebook below is a wrapper over the the model (native transformers inference API) for passing the predefined schema of promps in batching mode. https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_gemma_3.ipynb Limitation: schema supports texts only (for now), while gemma-3 is a text+image to text. Model: https://huggingface.co/google/gemma-3-1b-it Provider: https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/transformers_gemma3.py

View all activity

Organizations

None yet

nicolay-r's activity

liked 3 models 3 days ago

liked a model 5 days ago

meta-llama/Llama-3.2-1B-Instruct

Text Generation • Updated Oct 24, 2024 • 2.7M • • 822

liked a model 23 days ago

Qwen/Qwen2.5-3B-Instruct

Text Generation • Updated Sep 25, 2024 • 652k • • 214

liked 2 models about 1 month ago

facebook/mgenre-wiki

Text2Text Generation • Updated Jan 24, 2023 • 950 • • 28

sapienzanlp/relik-entity-linking-base

Updated Aug 7, 2024 • 41 • 3

liked a dataset about 1 month ago

open-thoughts/OpenThoughts-114k

Viewer • Updated 24 days ago • 228k • 75.5k • 654

liked a Space about 1 month ago

558

Qwen2.5 Max Demo

🐢

Chat with an AI language model

liked 2 models about 2 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

Text Generation • Updated 20 days ago • 1.25M • 548

deepseek-ai/DeepSeek-R1

Text Generation • Updated 20 days ago • 2.07M • • 11.4k

liked a Space 2 months ago

359

Open Medical-LLM Leaderboard

🥇

Browse and submit LLM evaluations

liked a model 2 months ago

johnsnowlabs/JSL-MedLlama-3-8B-v2.0

Text Generation • Updated Apr 30, 2024 • 4.05k • 33

liked a model 6 months ago

meta-llama/Llama-3.2-3B-Instruct

Text Generation • Updated Oct 24, 2024 • 2.91M • • 1.22k

liked a model 8 months ago

hyy-33/hyy33-WASSA-2024-Track-2

Updated Jul 9, 2024 • 2

liked 3 models 9 months ago

google/gemma-2-9b-it

Text Generation • Updated Aug 27, 2024 • 260k • • 687

google/gemma-2-27b-it

Text Generation • Updated Aug 27, 2024 • 146k • • 541

Qwen/Qwen2-7B-Instruct

Text Generation • Updated Aug 21, 2024 • 262k • • 621

liked 2 models 10 months ago

mistralai/Mistral-7B-Instruct-v0.3

Text Generation • Updated Aug 21, 2024 • 951k • • 1.49k

microsoft/Phi-3-small-8k-instruct

Text Generation • Updated Aug 30, 2024 • 43.7k • 164