Open-Orca (OpenOrca)

Nymbo

posted an update about 24 hours ago

Post

634

Haven't seen this posted anywhere - Llama-3.3-8B-Instruct is available on the new Llama API. Is this a new model or did someone mislabel Llama-3.1-8B?

Nymbo

posted an update 9 days ago

Post

904

PSA for anyone using Nymbo/Nymbo_Theme or Nymbo/Nymbo_Theme_5 in a Gradio space ~

Both of these themes have been updated to fix some of the long-standing inconsistencies ever since the transition to Gradio v5. Textboxes are no longer bright green and in-line code is readable now! Both themes are now visually identical across versions.

If your space is already using one of these themes, you just need to restart your space to get the latest version. No code changes needed.

louisbrulenaudet

posted an update about 2 months ago

Post

1025

I’ve just released logfire-callback on PyPI, designed to facilitate monitoring of Hugging Face Transformer training loops using Pydantic Logfire 🤗

The callback will automatically log training start with configuration parameters, periodic metrics and training completion ⏱️

Install the package using pip:

pip install logfire-callback

First, ensure you have a Logfire API token and set it as an environment variable:

export LOGFIRE_TOKEN=your_logfire_token

Then use the callback in your training code:

from transformers import Trainer, TrainingArguments
from logfire_callback import LogfireCallback

# Initialize your model, dataset, etc.

training_args = TrainingArguments(
    output_dir="./results",
    num_train_epochs=3,
    # ... other training arguments
)

trainer = Trainer(
    model=model,
    args=training_args,
    train_dataset=train_dataset,
    callbacks=[LogfireCallback()]  # Add the Logfire callback here
)

trainer.train()

If you have any feedback, please reach out at @louisbrulenaudet

not-lain

posted an update about 2 months ago

Post

2545

🚀AraClip is now fully integrated with Hugging Face 🤗

AraClip is a specialized CLIP model that was created by @pain and optimized for Arabic text-image retrieval tasks🔥

🔗 Try it out 🔗
🤖 model: Arabic-Clip/araclip
🧩 Gradio demo: Arabic-Clip/Araclip-Simplified
🌐 website: https://arabic-clip.github.io/Arabic-CLIP/

2 replies

·

Alignment-Lab-AI

updated a dataset 3 months ago

Open-Orca/OpenOrca

Viewer • Updated Feb 19 • 2.94M • 9.8k • 1.4k

louisbrulenaudet

posted an update 3 months ago

Post

3367

I am pleased to introduce my first project built upon Hugging Face’s smolagents framework, integrated with Alpaca for financial market analysis automation 🦙🤗

The project implements technical indicators such as the Relative Strength Index (RSI) and Bollinger Bands to provide momentum and volatility analysis. Market data is retrieved through the Alpaca API, enabling access to historical price information across various timeframes.

AI-powered insights are generated using Hugging Face’s inference API, facilitating the analysis of market trends through natural language processing with DuckDuckGo search integration for real-time sentiment analysis based on financial news 🦆

Link to the GitHub project: https://github.com/louisbrulenaudet/agentic-market-tool

arielnlee

authored 2 papers 3 months ago

From Text to Pose to Image: Improving Diffusion Model Control and Quality

Paper • 2411.12872 • Published Nov 19, 2024

Bridging the Data Provenance Gap Across Text, Speech and Video

Paper • 2412.17847 • Published Dec 19, 2024 • 9

not-lain

posted an update 3 months ago

Post

4407

I have just released a new blogpost about kv caching and its role in inference speedup 🚀
🔗 https://huggingface.co/blog/not-lain/kv-caching/
some takeaways :

4 replies

·

not-lain

posted an update 4 months ago

Post

1708

we now have more than 2000 public AI models using ModelHubMixin🤗

not-lain

posted an update 4 months ago

Post

4070

Published a new blogpost 📖
In this blogpost I have gone through the transformers' architecture emphasizing how shapes propagate throughout each layer.
🔗 https://huggingface.co/blog/not-lain/tensor-dims
some interesting takeaways :

jph00

authored 2 papers 5 months ago

The Matrix Calculus You Need For Deep Learning

Paper • 1802.01528 • Published Feb 5, 2018 • 2

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 149

flavoredquark

authored 4 papers 6 months ago

Dockerface: an Easy to Install and Use Faster R-CNN Face Detector in a Docker Container

Paper • 1708.04370 • Published Aug 15, 2017 • 1

Fine-Grained Head Pose Estimation Without Keypoints

Paper • 1710.00925 • Published Oct 2, 2017

Subject-driven Text-to-Image Generation via Apprenticeship Learning

Paper • 2304.00186 • Published Apr 1, 2023

ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning

Paper • 2411.05003 • Published Nov 7, 2024 • 72

not-lain

posted an update 6 months ago

Post

2378

ever wondered how you can make an API call to a visual-question-answering model without sending an image url 👀

you can do that by converting your local image to base64 and sending it to the API.

recently I made some changes to my library "loadimg" that allows you to make converting images to base64 a breeze.
🔗 https://github.com/not-lain/loadimg

API request example 🛠️:

from loadimg import load_img
from huggingface_hub import InferenceClient

# or load a local image
my_b64_img = load_img(imgPath_url_pillow_or_numpy ,output_type="base64" ) 

client = InferenceClient(api_key="hf_xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx")

messages = [
	{
		"role": "user",
		"content": [
			{
				"type": "text",
				"text": "Describe this image in one sentence."
			},
			{
				"type": "image_url",
				"image_url": {
					"url": my_b64_img # base64 allows using images without uploading them to the web
				}
			}
		]
	}
]

stream = client.chat.completions.create(
    model="meta-llama/Llama-3.2-11B-Vision-Instruct", 
	messages=messages, 
	max_tokens=500,
	stream=True
)

for chunk in stream:
    print(chunk.choices[0].delta.content, end="")

louisbrulenaudet

posted an update 6 months ago

Post

2095

I’ve published a new dataset to simplify model merging 🤗

This dataset facilitates the search for compatible architectures for model merging with @arcee_ai’s mergekit, streamlining the automation of high-performance merge searches 📖

Dataset : louisbrulenaudet/mergekit-configs

1 reply

·

Alignment-Lab-AI

posted an update 6 months ago

Post

1455

remember boys and girls, always keep all your data, its never a waste of time!

OpenOrca

AI & ML interests

Open-Orca's activity

Open-Orca/OpenOrca

From Text to Pose to Image: Improving Diffusion Model Control and Quality

Bridging the Data Provenance Gap Across Text, Speech and Video

The Matrix Calculus You Need For Deep Learning

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Dockerface: an Easy to Install and Use Faster R-CNN Face Detector in a Docker Container

Fine-Grained Head Pose Estimation Without Keypoints

Subject-driven Text-to-Image Generation via Apprenticeship Learning

ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning

AI & ML interests

Team members 41

Open-Orca's activity