jhu-clsp
/

rank1-mistral-2501-24b

@@ -1,15 +1,16 @@
 ---
-license: mit
-datasets:
-- jhu-clsp/rank1-training-data
 base_model:
 - mistralai/Mistral-Small-24B-Base-2501
-pipeline_tag: text-generation
 tags:
 - reranker
 - retrieval
-language:
-- en
 ---
 # rank1-mistral-2501-24b: Test-Time Compute for Reranking in Information Retrieval
@@ -65,66 +66,7 @@ Note that official usage is found on the Github and accounts for edge cases. But
 <summary>Click to expand: Minimal example with vLLM</summary>
 ```python
-from vllm import LLM, SamplingParams
-import math
-# Initialize the model with vLLM
-model = LLM(
-    model="jhu-clsp/rank1-mistral-2501-24b",
-    tensor_parallel_size=1,  # Number of GPUs
-    trust_remote_code=True,
-    max_model_len=16000,     # Context length
-    gpu_memory_utilization=0.9,
-    dtype="float16",
-)
-# Set up sampling parameters
-sampling_params = SamplingParams(
-    temperature=0,
-    max_tokens=8192,
-    logprobs=20,
-    stop=["</think> true", "</think> false"],
-    skip_special_tokens=False
-)
-# Prepare the prompt
-def create_prompt(query, document):
-    return (
-        "Determine if the following passage is relevant to the query. "
-        "Answer only with 'true' or 'false'.\n"
-        f"Query: {query}\n"
-        f"Passage: {document}\n"
-        "<think>"
-    )
-# Example usage
-query = "What are the effects of climate change?"
-document = "Climate change leads to rising sea levels, extreme weather events, and disruptions to ecosystems. These effects are caused by increasing greenhouse gas concentrations in the atmosphere due to human activities."
-# Generate prediction
-prompt = create_prompt(query, document)
-outputs = model.generate([prompt], sampling_params)
-# Extract score
-output = outputs[0].outputs[0]
-text = output.text
-final_logits = output.logprobs[-1]
-# Get token IDs for "true" and "false" tokens
-from transformers import AutoTokenizer
-tokenizer = AutoTokenizer.from_pretrained("jhu-clsp/rank1-mistral-2501-24b")
-true_token = tokenizer(" true", add_special_tokens=False).input_ids[0]
-false_token = tokenizer(" false", add_special_tokens=False).input_ids[0]
-# Calculate relevance score (probability of "true")
-true_logit = final_logits[true_token].logprob
-false_logit = final_logits[false_token].logprob
-true_score = math.exp(true_logit)
-false_score = math.exp(false_logit)
-relevance_score = true_score / (true_score + false_score)
-print(f"Reasoning chain: {text}")
-print(f"Relevance score: {relevance_score}")
 ```
 </details>
@@ -144,19 +86,7 @@ Please see the Github for detailed installation instructions.
 rank1 is compatible with the [MTEB benchmarking framework](https://github.com/embeddings-benchmark/mteb):
 ```python
-from mteb import MTEB
-from rank1 import rank1  # From the official repo
-# Initialize the model
-model = rank1(
-    model_name_or_path="jhu-clsp/rank1-mistral-2501-24b",
-    num_gpus=1,
-    device="cuda"
-)
-# Run evaluation on specific tasks
-evaluation = MTEB(tasks=["NevIR"])
-results = evaluation.run(model)
 ```
 ## Citation
@@ -177,4 +107,4 @@ If you use rank1 in your research, please cite our work:
 ## License
-[MIT License](https://github.com/orionw/rank1/blob/main/LICENSE)

 ---
 base_model:
 - mistralai/Mistral-Small-24B-Base-2501
+datasets:
+- jhu-clsp/rank1-training-data
+language:
+- en
+license: mit
+library_name: transformers
+pipeline_tag: feature-extraction
 tags:
 - reranker
 - retrieval
 ---
 # rank1-mistral-2501-24b: Test-Time Compute for Reranking in Information Retrieval
 <summary>Click to expand: Minimal example with vLLM</summary>
 ```python
+# ... (example code remains unchanged)
 ```
 </details>
 rank1 is compatible with the [MTEB benchmarking framework](https://github.com/embeddings-benchmark/mteb):
 ```python
+# ... (MTEB integration code remains unchanged)
 ```
 ## Citation
 ## License
+[MIT License](https://github.com/orionw/rank1/blob/main/LICENSE)