Timon Käch's picture

Timon Käch

CyberTimon

·

cybertimon

AI & ML interests

ai text generation

Recent Activity

liked a model 3 days ago

Qwen/Qwen2.5-Omni-3B

new activity 4 days ago

Qwen/Qwen3-32B:Feedback: It's a good model, however it hallucinates very badly at local facts (Germany)

new activity 4 days ago

unsloth/Qwen3-30B-A3B-GGUF:`UD-Q4_K_XL` or `Q4_K_M`?

View all activity

Organizations

CyberTimon's activity

New activity in Qwen/Qwen3-32B 4 days ago

Feedback: It's a good model, however it hallucinates very badly at local facts (Germany)

#12 opened 4 days ago by

New activity in unsloth/Qwen3-30B-A3B-GGUF 4 days ago

`UD-Q4_K_XL` or `Q4_K_M`?

#6 opened 4 days ago by

New activity in DavidAU/Command-R-01-Ultra-NEO-V1-35B-IMATRIX-GGUF 10 months ago

Amazing, thank you so much. Small question :)

#2 opened 10 months ago by

New activity in TIGER-Lab/Mantis-8B-siglip-llama3 12 months ago

GGUF Quants?

#1 opened 12 months ago by

New activity in cognitivecomputations/dolphin-2.9-llama3-8b-256k about 1 year ago

What is 256k?

#1 opened about 1 year ago by

New activity in Vezora/Mistral-22B-v0.1 about 1 year ago

Brain issues

#4 opened about 1 year ago by

New activity in QQGYLab/ELLA about 1 year ago

I Guess I'll Be "That Guy"... SDXL?

#2 opened about 1 year ago by

New activity in mistral-community/Mixtral-8x22B-v0.1 about 1 year ago

We are working on creating a single 22b from this model

#5 opened about 1 year ago by

New activity in databricks/dbrx-base about 1 year ago

Release weights of smaller Experimental MoE

#12 opened about 1 year ago by

New activity in Lykon/dreamshaper-xl-lightning about 1 year ago

Pattern

#3 opened about 1 year ago by

New activity in TheBloke/CapybaraHermes-2.5-Mistral-7B-GPTQ about 1 year ago

Whats up with TheBloke ?

#2 opened about 1 year ago by

New activity in wangfuyun/AnimateLCM about 1 year ago

Not working

#2 opened over 1 year ago by

New activity in vikhyatk/moondream1 over 1 year ago

Quants

#1 opened over 1 year ago by

New activity in openchat/openchat_3.5 over 1 year ago

Hallucinations

#2 opened over 1 year ago by

New activity in DiscoResearch/DiscoLM-70b over 1 year ago

Fix Stop Tokens

#2 opened over 1 year ago by

New activity in CausalLM/72B-preview-llamafied-qwen-llamafy over 1 year ago

Exciting, how can you do that so quickly?

#1 opened over 1 year ago by

KnutJaegersberg

New activity in Qwen/Qwen-14B over 1 year ago

It would be useful if someone spent the time to convert the tokenizer to gpt2 HF format

#9 opened over 1 year ago by

KnutJaegersberg

New activity in VAGOsolutions/SauerkrautLM-7b-HerO over 1 year ago

Training dataset + Hyperparamters

#1 opened over 1 year ago by

New activity in coqui/XTTS-v2 over 1 year ago

Support for swiss german - please!

#15 opened over 1 year ago by

New activity in deepseek-ai/deepseek-coder-33b-instruct over 1 year ago

tokenizer.model

#6 opened over 1 year ago by