Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -177,11 +177,6 @@ print(f"Peak Memory Usage: {mem:.02f} GB")
 # Model Performance
-Need to install vllm nightly to get some recent changes
-```
-pip install vllm --pre --extra-index-url https://wheels.vllm.ai/nightly
-```
 ## Results (H100 machine)
 | Benchmark                        |                |                          |
 |----------------------------------|----------------|--------------------------|
@@ -199,6 +194,11 @@ Download sharegpt dataset: `wget https://huggingface.co/datasets/anon8231489123/
 Other datasets can be found in: https://github.com/vllm-project/vllm/tree/main/benchmarks
 ## benchmark_latency
 Run the following under `vllm` source code root folder:
 ### baseline

 # Model Performance
 ## Results (H100 machine)
 | Benchmark                        |                |                          |
 |----------------------------------|----------------|--------------------------|
 Other datasets can be found in: https://github.com/vllm-project/vllm/tree/main/benchmarks
 ## benchmark_latency
+Need to install vllm nightly to get some recent changes
+```
+pip install vllm --pre --extra-index-url https://wheels.vllm.ai/nightly
+```
 Run the following under `vllm` source code root folder:
 ### baseline